NASA Conference Publication 3251 


Conference on 
Intelligent Robotics 
in Field, Factory, 
Service, and Space 
(CIRFFSS ’94) 

Volume II 

(March 23-24) 


Proceedings of a conference sponsored by 
The American Institute of Aeronautics & Astronautics 
and the National Aeronautics and Space Administration, 

Lyndon B. Johnson Space Center, 
Houston, Texas 

March 21-24, 1994 




The American Institute of 
Aeronautics and Astronautics 


National Aeronautics and 
Space Administration 


NASA Conference Publication 3251 


Conference on 
Intelligent Robotics 
in Fieid, Factory, 
Service, and Space 
(CIRFFSS ’94) 

Volume II 

(March 23-24) 


Jon D. Erickson, Editor 
NASA Lyndon B. Johnson Space Center 

Houston, Texas 

Proceedings of a conference sponsored by 
The American Institute of Aeronautics & Astronautics 
and the National Aeronautics and Space Administration, 
Lyndon B. Johnson Space Center, Houston, Texas 

March 21 -24, 1994 

In cooperation with 
American Association of Artificial Intelligence, 
IEEE Robotics and Automation Society, 
National Service Robot Association, Robotic Industries Association, 
Society of Optical Instrumentation Engineers, 
AIAA Space Automation and Robotics Technical Committee, 
Clear Lake Council of Technical Societies, AIAA Houston Section, 
ISA Clear Lake Section, and IEEE Galveston Bay Section 



National Aeronautics and 
Space Administration 


Preface 


The formation of the AIAA/NASA Conference on 
Intelligent Robotics in Field, Factory, Service, and 
Space (CIRFFSS '94) was originally proposed 
because of the strong belief that America's prob- 
lems of global economic competitiveness and job 
creation and preservation can partly be solved by 
the use of intelligent robotics, which are also 
required for human space exploration missions. It 
was also recognized that in the applications-driven 
approach there are a far greater set of common 
problems and solution approaches in field, factory, 
service, and space applications to be leveraged for 
time and cost savings than the obvious differences 
in implementation details would lead one to be- 
lieve. This insight, coupled with a sense of nation- 
al urgency, made a continuing series of conferences 
to share the details of the common problems and 
solutions across these different fields of application 
not only a natural step, but a necessary one. Fur- 
ther, it was recognized that a strong focusing effort 
is needed to move from recent factory-based robot 
technology into robotic systems with sufficient 
intelligence, reliability, safety, multi-task flexibil- 
ity, and human/machine interoperability to meet the 
rigorous demands of each of these fields of appli- 
cation. The scope of this effort is beyond the 
capability of the private sector alone, government 
alone, or academia alone. Cooperation by all 
interested parties is essential to achieve the needed 
investments and maximize the benefits from 
innovation. 

The first AIAA/NASA conference on intelligent 
robotics is a clear success, judging from the quality 
and number of papers for presentation and manu- 
scripts collected in these proceedings. Also, having 
the proceedings available at the conference is 


important to communication effectiveness and 
efficiency; the authors are to be congratulated for 
meeting the deadline. Having Dr. Joseph Engel- 
berger. Chief Executive Officer of Transitions 
Research Corporation, present the keynote address 
emphasizing the applications-driven approach to 
technology development sets the correct tone and 
background for getting on with the job of strategic 
investment in and development of intelligent 
robotics through cooperative national efforts. 

The papers in these proceedings are evidence that 
users in each field, manufacturers and integrators, 
and technology developers are rapidly increasing 
their understanding of the "whats" and "hows" of 
integrating robotic systems on Earth and in space to 
accomplish economically important tasks requiring 
mobility and manipulation. The 21 sessions of 
technical papers in seven tracks plus two plenary 
sessions cover just the tip of this major progress, 
but reveal its presence nonetheless. 

The contents pages of these proceedings do not 
necessarily reflect the final program nor the ar- 
rangement of presentations in sessions. The con- 
ference brochure provides the information. 

Appreciation goes to the Steering Committee mem- 
bers, Program Committee members. Track chairs, 
and Session chairs who are all so essential to mak- 
ing this a successful conference through the volun- 
tary giving of their time and efforts. Special thanks 
and personal admiration go to Larry Seidman, 

Zafar Taqvi, Hatem Nasr, Mary Stewart, Donna 
Maloy, and Dottye Hamblin for their efforts to 
make this conference happen. 



Jon D. Erickson 
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Abstract 

We present an overview of the autonomous helicopter 
project at Carnegie Mellon’s Robotics Institute. The goal 
of this project is to autonomously fly helicopters using com- 
puter vision closely integrated with other on-board sen- 
sors. We discuss a concrete example mission designed to 
demonstrate the viability of vision-based helicopter flight 
and specify the components necessary to accomplish this 
mission. Major components include customized vision pro- 
cessing hardware designed for high bandwidth and low la- 
tency processing and 6-degree-of-freedom test stand de- 
signed for realistic and safe indoor experiments using model 
helicopters. We describe our progress in accomplishing an 
indoor mission and show experimental results of estimating 
helicopter state with computer vision during actual flight 
experiments. 

Introduction 

Precise maneuverability of helicopters makes them use- 
ful for many critical tasks including rescue and security 
operations, traffic monitoring, mountain fire fighting, 
and inspection of power transmission lines. The goal of 
our project is to build a vision-guided helicopter capable 
of performing these tasks while flying autonomously. In 
addition to robust helicopter control methods, the de- 
velopment of such a system requires research on vision 
algorithms for helicopter positioning and object recog- 
nition necessary for navigation and tracking tasks, to- 
gether with real-time hardware for high speed, robust 
execution of these tasks. 

An autonomous helicopter’s performance is critically 
dependent on accurate and frequent estimates of its po- 
sition and attitude. We focus on methods to provide 
these estimates using on-board cameras closely inte- 
grated with other sensors such as gyroscopes and ac- 
celerometers. 


We have demonstrated our first results on au- 
tonomous helicopter flight. We have built an indoor cal- 
ibrated testbed that allows free flight experiments with 
model helicopters. We have custom designed vision 
hardware which integrates data from on-board sensors 
with real-time image processing and can now achieve 
frame-rate (30 Hz) vision-based state estimation. Inte- 
grating this vision hardware into a stable control sys- 
tem will lead to outdoor autonomous helicopter flight 
for performing useful, practical missions. 

Motivation 

A helicopter is an indispensable air vehicle for emer- 
gency operations, such as rescuing stranded individuals 
and spraying fire extinguishing chemicals for fighting 
forest fires. Uses of helicopters in the electric power in- 
dustry include inspecting towers and transmission lines 
for corrosion and other defects. All of these applications 
demand dangerous flight patterns in close proximity to 
the ground or other objects which can risk pilot safety. 
An unmanned helicopter that operates autonomously 
or is piloted remotely will eliminate these risks and in- 
crease the helicopter’s effectiveness. 

Typical missions of autonomous helicopters require 
flying at low speeds to follow a path or hovering near 
an object. Positioning equipment such as Inertial Nav- 
igation Systems (INS) or Global Positioning Systems 
(GPS) are well suited for long range, low precision heli- 
copter flight and fall short for very precise, close prox- 
imity flight. Maneuvering helicopters close to objects 
requires accurate positioning in relation to the objects. 
Visual sensing is a rich source of data for this relative 
feedback. 

It is difficult, however, to recover helicopter position 
and attitude from vision alone. For instance, distin- 
guishing between rotation and translation in a sequence 
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of images under perspective projection is extremely dif- 
ficult. On the other hand, the new generation of light- 
weight gyroscopes and angular rate sensors in the mar- 
ket provide reliable measurement of angular change in 
an image sequence. For this reason, we concentrate on 
low-level, close integration of such sensors with vision. 


Related Work 

The study of the helicopter control problem is not new. 
Overcoming the inherent instability of helicopters has 
been the focus of a large body of research, includ- 
ing detailed mathematical models (eg., [10]) for con- 
trol and Kalman filtering of multiple sensor data for 
state estimation (eg., [3]). The controller design meth- 
ods range from linear quadratic (LQ) design to 
design [19] and predictive control [8]. For example, a 
stable closed loop control system has been formulated 
[3] by quadratic synthesis techniques for helicopter au- 
tolanding. 

Recently, incorporation of a pilot model has been at- 
tempted based on quadratic optimal Cooperative Con- 
trol Synthesis [17]. This model is used for control aug- 
mentation where the control system cooperates with the 
pilot to increase aircraft performance. The sophisti- 
cated pilot model developed by [7] attempts to describe 
the human’s ability to look ahead, which is crucial to 
precise low- altitude helicopter control. While it is dif- 
ficult to identify and verify these models, they provide 
a valuable basis for an intelligent helicopter controller, 
especially in designing low-level control loops. In this 
project, we employ a set of low-level controllers which 
have been designed by using a simplified helicopter dy- 
namics model. 

Actual flight tests of helicopter controllers have also 
been done. Notable implemented systems include those 
at NASA Ames Research Center [17], NASA Lang- 
ley Research Center [3], and military aircraft manu- 
facturers [5]. Fuzzy controllers have been successfully 
employed for actual helicopter flight experiments. In 
Japan, Sugeno’s group at Tokyo Institute of Technol- 
ogy [14] has demonstrated fuzzy control of helicopters 
for crop dusting. 

The state feedback for the above helicopter con- 
trol experiments was primarily provided by on-board 
INS/GPS or ground-based beacon systems instead of 
on-board computer vision. Recently, we are beginning 
to see promising results in real-time vision-based pro- 
cessors, visual servoing of robotic manipulators, and ac- 
curate vision-based position estimation systems, some 
of which are applicable to autonomous helicopter con- 
trol experiments. 


The development of low cost special-purpose im- 
age correlation chips and new multi-processor architec- 
tures capable of high communication rates has made 
a great impact on image processing. Examples of vi- 
sion systems built from this kind of hardware include 
transputer-based image hardware for two-dimensional 
object tracking [4] and real-time tracking and depth 
map generation using correlation chips [9]. 

The high rate of image processing has made inclu- 
sion of visual feedback in servo loops practical. There 
is significant development in visual control of manip- 
ulators carrying small cameras, eye-in-hand configura- 
tion. Researchers at Carnegie Mellon’s Robotics Insti- 
tute [12] demonstrated real-time visual tracking of ar- 
bitrary 3D objects traveling at unknown 2D velocities 
using a direct-drive manipulator arm. The Yale spatial 
robot juggler [13] demonstrated transputer-based stereo 
vision for locating juggling balls in real time. Real-time 
tracking and interception of objects using a manipula- 
tor [11] has also been demonstrated bzised on fusion of 
the visual feedback and acoustic sensing. 

Controlling by vision requires position estimation rel- 
ative to desired objects and extraction of 3D scene 
structure based on sequence of images. RAPiD and 
DROID [6], developed by Roke Manor Research Lim- 
ited, are systems designed for performing such tasks 
in unknown environments. RAPiD is a model-based 
tracker capable of extracting the position and orien- 
tation of known objects in the scene. DROID is a 
feature-based system which uses the structure-from- 
motion principle for extracting scene structure using 
image sequences. Real-time implementations of these 
systems have been demonstrated using dedicated hard- 
ware. 

Integrating efficient model-based and connectionist 
techniques with powerful hardware architectures has 
produced an array of autonomous land and air ve- 
hicles. Significant advances in autonomous automo- 
biles has demonstrated vision-based control at high- 
way speeds. Most notable are Carnegie Mellon’s 
Navlab [16] project and the work of Dickmanns at Uni- 
versity of Bundeswehn, Munich involved with European 
PROMETHEUS project [2]. 

Dickmanns applies a 4D approach exploiting spatio- 
temporal models of objects in the world to autonomous 
land and air vehicle control [1]. He has demonstrated 
autonomous state estimation for an aircraft in landing 
approach using a video camera, inertial gyros and an 
air velocity meter. Vision-based state estimation is also 
pursued at NASA Ames Research Center [15] using par- 
allel implementation of multi-sensor range estimation 
for helicopter flight. 
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Figure 1: Indoor Testbed 

An electrical model helicopter is supported by six light-weight 
graphite rods. A frictionless air bearing couples each rod with 
two-degree-of-freedom joints mounted on poles seemed to the 
ground. Groimd truth helicopter position is calculated from joint 
€uigles measured by shaft encoders. 

Indoor Helicopter Testbed 

For practical, calibrated experimentation, we have de- 
signed and built an indoor helicopter testbed. It con- 
sists of an electrical model helicopter mounted on a 6- 
degree-of-freedom (6-DOF) test stand (see Figure 1). 
Using the testbed, we can test each critical component 
necessary for autonomous flight before attempting po- 
tentially dangerous outdoor free flight experiments. 

Model helicopters provide an inexpensive, safe, and 
logistically manageable way to experiment with heli- 
copter control. They are faithful reproductions of full 
size helicopters with respect to the crucial rotor controls 
and configurations. Control techniques developed for 
the model helicopters can be directly applied to larger 
scale helicopters. 

The helicopter in our testbed is attached to a fric- 
tionless 6-DOF stand as shown in Figure 1. The stand 
provides ground truth measurement of the helicopter 
position and attitude, and also works as a safety de- 
vice preventing crashes and out-of-control flight. The 
helicopter on the stand can fly freely in a cone-shaped 
volume six feet wide and five feet tall without major 
inertial variations from free flight. The helicopter is 
fastened to six fixed poles by six light-weight graphite 



Figure 2: Testbed System Configuration 


rods. Each graphite rod is free to move through a fric- 
tionless air bearing mounted on a two-degree-of-freedom 
joint. The joint angles are measured by shaft encoders 
and used by the computer to calculate the helicopter’s 
ground truth position and attitude for experiment eval- 
uation. 

The computer system configuration, shown in Fig- 
ure 2, consists of a host computer, customized vision 
processor, a real-time processor, synchronization hard- 
ware, and interfacing equipment. A hand-held radio 
transmitter used by a model helicopter pilot is inter- 
faced to real-time computers. Using this interface, we 
can send computer control signals to the helicopter. 
The same interface can be used for free flying heli- 
copters. 

With this testbed, we can perform controlled exper- 
iments over a wide range of conditions. We can create 
various wind conditions by using fans, terrain condi- 
tions by placing objects, and helicopter setups by ad- 
justing the mechanisms. Because of the safety provided 
by the testbed, even potentially disastrous situations 
like the failure of critical helicopter parts can be tested. 

Using a simplified helicopter dynamics model we have 
implemented a control system capable of hovering the 
helicopter using linear controllers tuned at different op- 
erating points. This control system provides us with a 
stable platform necessary for conducting low-speed and 
hovering experiments. 

One apparent limitation of the test stand is its in- 
ability to support larger model helicopters capable of 
lifting several sensors at once. On the other hand, since 
the test stand provides ground truth data, we can sim- 
ulate data from certain sensors by purposely corrupting 
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Digitizer Configuration 

The helicopter has multiple on-board sensors: two 

ground-pointing black and white CCD video cameras, 
vertical and directional gyroscopes, and accelerometers 
for each translational axis. The data from these sensors 
is digitized using multiple special-purpose digitizers. In 
particular, our system provides variable sampling rates 
for image digitization. Typically, the NTSC video signal 
is sampled at 14.3 MHz which yields close to 1000 pixels 
per line. Conventional video digitizers choose 512 or 640 
of these pixels per line during digitization. Since most 
CCD cameras have less than 1000 CCDs per line, we 
directly control digitizer sampling to reduce image data 
bandwidth and to provide more original image content. 
The aspect ratio of the image changes with sampling 
frequency and must be properly calibrated. 


Figure 3: Vision Processor Structure 

the stand data before using it. Different sensors can be 
individually characterized by comparing their response 
with ground truth data and their presence on-board the 
helicopter can be simulated during experiments. 

Low Latency Vision Hardware for 
Helicopter Control 

Our experience controlling model helicopters using the 
test stand has shown the necessity of velocity and po- 
sition feedback rates of 15 to 30 Hz. Processing image 
data at these rates requires fast computers capable of 
acquiring and processing images at frame-rate (30 Hz). 
There are a number of new cost-effective compact CPU 
platforms designed for high speed data transfer and pro- 
cessing. Among the most popular are: SGS-Thomson 
inmos T9000 Transputer, Intel i860, and Texas Instru- 
ments TMS320C40 Digital Signal Processor (C40). Our 
development is based on the C40 platform primarily 
for its high speed communication ports each capable 
of transferring data at 20 MB/s. Other advantages in- 
clude: programmable Direct Memory Access (DMA) 
well-suited for image windowing operations, flexible 
memory architecture and internal bus structure, and 
wide availability. The structure of our customized vi- 
sion processor is shown by Figure 3. 

We have achieved close integration of vision with 
other on-board sensors using customized hardware de- 
signed to interface with an array of C40 processors. 
This low-level integration is key in providing robust ve- 
locity and position estimation. 


Convolution and Image Tagging 

Fast convolution is essential for image preprocessing. 
In addition to edge detection and smoothing, matching 
and feature extraction can be performed using special 
convolution masks. We use real-time convolution hard- 
ware to perform Gaussian smoothing before processing 
images. To reduce image data bandwidth, we subsam- 
ple the image using the digitizer before performing the 
smoothing operation. For the experiments described in 
this paper, 8x8 convolution masks were used on images 
sampled at 6 MHz pixel frequency. 

Using similar convolution hardware, accelerometer 
and gyroscope data are sampled at 120 Hz and fil- 
tered by 64x1 Gaussian FIR filters. The filtered data 
is sampled and incorporated in the image data stream 
by an image tagger. Precise temporal matching of this 
data with the image is performed by using the camera’s 
60 Hz field vertical sync clock (VSYNC) and shutter 
speed. We use 1 millisecond shutter speed for tagging 
images accurately and reducing image blurring during 
helicopter motion. 

High Speed Data Link 

Because of the camera’s VSYNC frequency, the pro- 
cessing time period for the sensor-tagged field images 
can only be multiples of 16.7 milliseconds. Barely miss- 
ing an image due to long processing time is expensive 
since the processor must wait for a new image for proper 
synchronization. Image field digitization alone requires 
16.7 milliseconds. During this time period the image 
must be transferred to the processor in order to achieve 
frame-rate (30 Hz) performance. We perform this trans- 
fer through a high speed data link designed to commu- 
nicate with C40 processor comm-ports. This link incor- 
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porates small hardware buffers to convert the incoming 
synchronous image stream to the asynchronous comm- 
port protocol of the C40. In addition, since the image 
data is not directly entering a frame buffer, the high 
speed link provides proper comm-port synchronization 
with the camera using an internal state- machine. The 
comm-port design reduces CPU memory bus traffic by 
using C40’s internal data buses and provides the ability 
to only transfer regions of interest using C40s versatile 
DMAs. These functions are crucial in improving pro- 
cessor speed. 


Search Mission 

As a concrete mission for an autonomous vision-guided 
helicopter, we envision a task of locating a known ob- 
ject in a predetermined outdoor area, for example, a 
particular car in a parking lot, and tracking the object 
by controlled helicopter flight. 

The development of the indoor test stand allows us to 
conveniently simulate search mission scenarios using a 
variety of objects and terrain for visual tracking experi- 
ments. By carefully choosing these indoor experiments, 
we expect similar performance outdoors. The differ- 
ences in flight altitude and terrain illumination can be 
resolved by small modifications to camera lenses, shut- 
ter speeds, and digitizing hardware. 

Our mission is to search for a small car stranded 
somewhere in rough terrain. Performing this task re- 
quires object recognition to find the car, and visual 
measurement of position and velocity for autonomous 
flight. We have covered the stand base with gravel col- 
lected from the outdoor mission site to provide a real- 
istic scene for our vision algorithms. 

Velocity and Position Measurement 

To measure helicopter velocity or position based on im- 
age data, we must first determine the displacement be- 
tween consecutive images. This displacement in cam- 
era pixel coordinates is a function of camera attitude 
and distance relative to objects in the scene and cam- 
era calibration parameters such as focal length. For 
the indoor search experiments, camera attitude is es- 
timated by gyroscopes and camera distance from the 
ground is estimated using the test stand. Performing 
outdoor experiments without the test stand requires al- 
titude measurement by stereo vision possibly integrated 
with a laser rangefinder or microwave radar system. 

The apparent displacement between consecutive im- 
ages is a result of camera translation and rotation. Dis- 
ambiguating rotation from translation is especially im- 
portant for helicopter control since helicopter transla- 



Figure 4: Effect of helicopter rotation 


tion is directly a result of its change in attitude. Fig- 
ure 4 shows the significance of this effect while the he- 
licopter flares for reducing forward speed or stopping. 

By carefully measuring the angular change between 
templates and images, we can estimate the effect of ro- 
tation and correct the image displacement to only re- 
flect translational motion. This correction is useless 
without precise synchronization of gyroscope data with 
images. The drift common to all gyroscopes is not a 
problem here, since only the change in attitude is nec- 
essary from frame to frame. 

Image Displacement Measurement 

We use template matching to measure the displacement 
between consecutive images. We use sum-of-squared- 
differences (SSD) as our matching criteria. Each tem- 
plate is an m X n window of image intensities selected 
from the previous image. The best match of the tem- 
plate in the image can be determined by minimizing the 
SSD of the template and image pixels. To reduce the 
amount of computation, we restrict our search area to a 
small window around the template’s neighboring pixels. 
The size of this search area is determined by helicopter 
altitude and anticipated worst case change in helicopter 
motion within one frame period. As the helicopter al- 
titude decreases, the same translational motion causes 
a larger displacement in the image. The minimum al- 
titude of the test stand is 1 meter and the on-board 
camera lens has 7.8 mm focal length. If we allow max- 
imum helicopter velocity to be 1.5 meters per second 
during hover, our maximum image template displace- 
ment is 32 pixels per frame. 

A coarse to fine strategy further improves search area 
and speed. We begin by using every fourth pixel to 
produce a coarse match for narrowing the search to 64 
possible pixel locations. This estimate is improved to 
subpixel accuracy by fitting a parabolic surface to the 
SSD of the 64 match candidates. Figure 5 shows an 
example of a fitted parabola. A good parabola fit will 
refine the best single pixel match within ±1 pixel. The 
parabola minimum is disregarded if it is not within one 
pixel of the single pixel match. 
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Figure 5: SSD Parabola Fit 


In addition to subpixel accuracy, the fitted parabola 
provides match uncertainty information. A steep 
parabola versus a shallower one signals a more accurate 
match. Covariance matrices constructed from parabola 
coefficients will allow us to combine data from each tem- 
plate using a Kalman filter to produce the best estimate 
of image displacement. 

For experiments reported here, we use four image 
templates for velocity and one template for position es- 
timation as shown by Figure 6. The four velocity tem- 
plates are 40 x 40 pixels in size and are positioned in 
each image quadrant. After each matching operation, 
the displacement of each template is calculated and the 
templates are updated with new image data from the 
same location. 

Actual velocity measurement during flight is shown 
by Figure 7. This figure compares ground truth lateral 
and longitudinal velocity measurement (solid line) from 
the test stand with vision-based velocity estimates. The 
dashed and dotted lines in each figure represent vision- 
based velocity estimates with and without attitude cor- 
rection. The correction was performed by measuring 
the attitude change between each template-image pair. 
Assuming images are taken from a locally flat surface, 
we can construct a transform, based on helicopter alti- 
tude and camera focal length, to convert the attitude 
change to a correction vector on the image plane for 
each template location. The effect of this correction is 
significant: 33 cm/s RMS error in lateral velocity mea- 
surement without attitude correction versus 5 cm / s af- 
ter correction. 

The position estimation template is 64 x 64 pixels 
in size and its location varies as the helicopter moves. 
This template is updated with image data from the best 
match in order to compensate for changes in helicopter 



Figure 6: Image Templates 
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Figure 7: Vision-Based Velocity Measurement 

The solid lines represent ground truth helicopter velocity from 
the test st^ind. The dotted lines show velocity b£ised on image 
displacement 2done ^md the dashed lines represent vision-based 
velocity with attitude correction. 
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Figure 8: Position Measurement 

The solid line represents ground truth helicopter position from 
the test stand. The dotted line shows position based on image 
displacement . 

altitude and heading. If the best template match is 
close to leaving the camera view, the position template 
is loaded from the image center. A larger search area of 
64 pixel displacement is used due to longer processing 
time. Figure 8 shows vision-based (dashed line) and 
ground truth (solid line) lateral position with respect 
to camera starting point. Attitude correction is more 
complicated in this case since the template changes po- 
sition in the image plane. The figure shows uncorrected 
position estimation. 

Position and Velocity Data Flow and Synchro- 
nization 

We can not overemphasize the role of accurate synchro- 
nization in integration of on-board sensor data with 
high speed image processing. As observed above, atti- 
tude correction by synchronizing image and gyroscope 
data produces a significant improvement on position 
and velocity measurement accuracy. Figure 9 shows 
the data flow and synchronization we are performing 
for above helicopter motion estimation. 

The solid vertical lines represent the camera VSYNC 
from the second image field (B). For high speed per- 
formance, only one image field (A) is used for mo- 
tion estimation. The process begins with opening the 
camera shutter for 1 millisecond prior to VSYNC. Fil- 
tered gyroscope and accelerometer data is sampled with 
VSYNC and included in the image data stream by the 
image tagger. The tagged image is transferred to C40-1 
which partitions field A for other C40s. The top half 
of field A is used by C40-1 and the bottom half by 
C40-2 for velocity estimation. In addition, field A is 
transferred to C40-3 for position estimation. Due to 
the high band-width of connections between C40s, it is 


Figure 9: Data Flow and Synchronization 

possible to start image processing during image trans- 
fer. The transferring is performed by DMAs which do 
not interfere with data processing. C40-1 also trans- 
fers synchronized gyroscope and accelerometer data to 
C40-4 which is responsible for state estimation and con- 
trol. The state estimation is performed by transforming 
image displacement data from other C40s to helicopter 
translational motion. The estimated translational ve- 
locity and position in conjunction with accelerometer 
and gyroscope data are used by linear control loops to 
control the helicopter. 

Object Search 

The vision-based velocity and position estimation pro- 
vides the basic capability for hovering and low-speed 
flight necessary for our indoor search mission. Locating 
the object of interest is the next step. We use template 
matching to perform this search. A major difficulty 
in this approach is that object orientation is unknown. 
This requires templates of the object in all possible ori- 
entation for matching. Methods such as K-L expan- 
sion [18] can be used to reduce computational complex- 
ity and storage of necessary templates. Another prob- 
lem stems from varying helicopter altitude which will 
change the size of the object in the image. Close regula- 
tion and measurement of helicopter altitude is necessary 
to further reduce the complexity of the search. 

We are conducting the search using a set of twenty 
32 X 32 templates. These twenty templates, generated 
by K-L transform techniques, are sufficient for locating 
objects with ±40® orientation discrepancy as accurately 
as one degree resolution. The processing frequency for 
searching the entire image is 3 Hz using one C40 proces- 
sor. Upon locating the object, the position estimator 
can now use the object in the image as its template pro- 
viding relative helicopter position necessary for object 
tracking. 
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Conclusions 

We have successfully developed the key components 
necessary for vision-guided autonomous flight. As our 
experimental results demonstrate, we are achieving 
real-time low latency image processing at suitable rates 
to stably fly helicopters. The major elements in our de- 
velopment have been custom designed vision hardware 
and indoor testbed. In addition to high speed process- 
ing, customized hardware provides flexible integration 
of on-board sensors which significantly improves vision- 
based state estimation. The indoor testbed provides 
convenient calibrated experimentation which is essen- 
tial in building real autonomous systems. 
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Abstract 

Real-time tracking of multiple targets or geometrical 
features of an object using a variable resolution laser 
scanner is presented. The scanner is characterized by 
its robustness to ambiant illumination, even the sun 
shining directly into the sensor, and its tracking 
resolution. The sensor to extract registered range and 
intensity information for each scanned point on the 
object at a rate of 18 kHz uses two high-speed 
galvanometers and a collimated laser beam. Three- 
dimensional real-time tracking using Lissajous 
patterns proves to be very attractive for space 
applications. Integration with the existing 
photogrammetry-based Advanced Space Vision 
System (ASVS) is discussed. 


Introduction 

It is now well accepted that vision will play a major 
role in both supervised and unsupervised operations 
for the automation of several space-related activities. 
Specifications state that the Artificial Vision Unit will 
support rendez-vous and proximity operations 
including payload tracking, capture, and berthing in 
both teleoperation and adaptive control modcs.’’^ It is 
also recognized that vision will play a major role 
during the assembly and maintenance operations of 
the space station. 

The current Artificial Vision Function (AVF) is based 
on the Space Vision System (SVS) that was 


demonstrated on CANEX-2. The SVS analyzes video 
signals from the closed circuit television system of 
the space shuttle and provides real-time position and 
orientation information about an object with a 
cooperative target array The baseline requirements 
for the Artificial Vision Function are as follows: 

- To identify a suitably illuminated object (target) 
from its video image. 

- To estimate the position, attitude, translation, and 
rotational rate of the object. 

- To provide appropriate camera control to track 
the object. 

- To be able to track objects before capture by 
manipulators and berth objects/payloads handled 
by manipulators. 

To achieve robust adaptive control, supervision, and 
inspection, the vision system must be 100% 
operational throughout the changing illumination 
conditions in orbit. Unfortunately, the quality of the 
images produced by standard video-camera-based 
systems is adversely affected by the presence of the 
sun or any other strong source of light. Poor contrast 
between features on the object and background, and 
saturation of the photo-detector array often make it 
difficult to analyze the video images. Video camera 
reliability during normal operation is questionable and 
can compromise the success of the operation or at 
least seriously limit its practical use. 

Although camera-based systems are very attractive 
because of their ease of use and simplicity of 
integration with existing equipment, it is highly 
desirable to offer a complementary vision system that 
will not be restricted by operational conditions such 
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as sun interference. The proposed laser scanner 
approach offers the advantage of being almost 100% 
operational throughout the changing illumination 
conditions in orbit. The technique, developed at NRC 
in collaboration with the CSA, is designed to be 
insensitive to background illumination such as the 
earth albedo and the sun radiation and most of its 
reflections.^'^ Immunity to background interference is 
the result of the very narrow band of frequencies 
emitted by the laser light that can be tuned by optical 
filters, by special optical design to extract range 
information, and by proprietary real-time signal 
processing techniques used to distinguish between the 
laser beam and any remaining sources of interference 
(e.g., specular reflections). 

The Laser Scanner 

Figure 1 shows the geometry of this prototype based 
on the auto-synchronized time-flying single-point 
technique.'' The system is comprised of two 
orthogonally mounted scanning mirrors: the X-axis 
scanning mirror, used for range triangulation, and the 
Y-axis mirror. The two mirrors are driven by accurate 
galvanometers. The light beam generated by the laser 
is deflected by a mirror and scanned on the object. A 
camera, consisting of a lens and a position-sensitive 
photodetector, measures the location of the image 



Figure 1: Schematic representation of the dual- 
axis auto-synchronized range scanner (reprinted 
from Reference 7). 


illuminated point on the object. By simple 
trigonometry, the three-dimensional coordinates of 
that point are calculated.^’ 

Some of the advantages of the auto-synchronized 
technique are a large field of view, high accuracy, 
and immunity to ambiant illumination.* A 
conventional camera must continuously monitor the 
whole field of view, making it very susceptible to sun 
interference. The laser scanner technique has a small 
instantaneous field of view limiting the undesired 
interference. Such a system is optically light efficient 
because the received laser power is focused onto a 
small spot thereby increasing the signal-to-noise ratio 
of the returned signal. Automatic control of the laser 
power source further extends the dynamic range of 
the scanner. 

Figure 2 illustrates the equivalent optical geometry. 
The basic concept is that the projection of the light 
spot is synchronized with its detection. The 
instantaneous field of view of the position sensor 
follows the spot as it scans the scene. An external 
optical perturbation can interfere with the detection 
only when it intercepts the instantaneous field of 
view of the scanner. At this level, electronic signal 
processing is used to filter these false readings to 
obtain the correct 3-D measurement. The total field of 
view of the laser camera is related only to the 
scanning angles of the galvanometers and mirrors as 
opposed to a conventional camera where field of view 
and image resolution are intimately linked, the larger 
field of view the smaller pixel resolution achievable. 



Figure 2: Basic principle of synchronized 
scanner, second axis not shown (reprinted from 
Reference 8 ). 
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Although laser scanners are usually slower than their 
TV camera counterparts when used in a conventional 
imaging or raster scan mode of operation, the same 
laser scanner can obtain refresh rates of more than 
130 Hz with a pointing resolution of 15000 elements 
X 1 5000 elements in the tracking mode (single target). 
Consequently, the resolution and speed of the laser 
scanner exceed conventional video cameras. The high 
pointing accuracy of a laser beam and the large depth 
of field of the scanner enable the acquisition of large 
3-D images of 4000 pixels x 4000 pixels or more. To 
achieve similar resolution, a video camera would 
need a high-quality optical lens and specially 
designed high-resolution CCD cameras. 

A custom-designed galvanometer controller board 
gives access to any pixel in the field of view of the 
range sensor and provides the random access tracking 
capability.^’"* The controller permits interrogation of 
any point in the scene without having to sample the 
entire field of view of the scanner. This Region Of 
Interest (ROI) sampling technique offers great 
potential for rapid and efficient acquisition of dense 
and accurate 3-D images over a large volume of 
measurement. 

Figure 3 is a photograph of the laboratory prototype 
of the auto-synchronized laser scanner developed for 



Figure 3: Photograph of the laser scanner 
prototype. 


this application. Tables I and II and Figures 4 and 5 
summarize the characteristics of this laser scanner 
prototype in the triangulation mode of operation. 



Figure 4: Range resolution (single point). 



Figure 5: Angular pointing resolution (tracking 
mode using Lissajous figures). 


Real-time Tracking 


Real-time tracking of targets or geometrical features 
on an object is implemented using Lissajous figures, 
to obtain good scanning speed and accuracy^ A 
Lissajous figure is mathematically defined by 

0 ( t) =AqCos (/no) t+ 60 ) + 0Q (1) 

4>(t) =A^cos(no) 6^) + 4>o (2) 

where Aq and are the amplitudes of the Lissajous 
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Table I: Physical characteristics. 


Dimensions 

10.5 in X 6.25 in x 4 in 

Data Type 

3-D and Intensity 

Field of view 

30° X 30° 

Working range 

0.6 ft to TBD (>150 ft) 

Range accuracy 

1 See Figure 4 

Acquisition Speed 

i 18 kHz 

Scanning System 

j Galvanometers 

Resolution X-Y 

1 1 / 15000 


pattern, co the refresh rate of the scan, 5e and 8^ the 
relative phases of the sinewaves, t the sampling time, 
and ©0 and the position of the center Lissajous 
pattern. The pattern shape is defined using parameters 
m\n. For example the pattern illustrated in Figures 6 
and 7 is a 3:2 Lissajous figure. 


Lissajous 

Scan 



Tracking 

Error 


Center 
of target 


Figure 6: Tracking of a circular target using a 3:2 
Lissajous pattern. 


Lissajous patterns are used to scan objects at refresh 
rates exceeding the frequency response of the 
mechanical deflection system. The scanning device is 
not required to stop the acquisition after each trace 
line as opposed to the raster-type scan illustrated in 
Figure 8. Instead, the full pattern is used to acquire 
both range and intensity information. Furthermore, 
the natural inertia of the galvanometer-mirror 
structure smooths the scanning pattern and hence 
increases the pointing accuracy of the tracking 



Figure 7: Real-time tracking of geometrical targets 
(e.g., corner or targets) using Lissajous patterns 
and the laser range sensor. 



Figure 8: Conventional raster-type acquisition of 
the 3-D shape of the object. 

system. As well, Lissajous pattern signals are 
optimally filtered using the Fourier transform (or 
notch filter), thus reducing electrical noise, 
quantization effects, and distortions.^ 

Tracking is implemented using the 3-D range 
information on the Lissajous pattern, the returned 
intensity signal from the object, or a combination of 
both. For example, the location and orientation of the 
geometrical target is obtained with either the 3-D or 
intensity edges of the object or the intersection of the 
measured surfaces on an object. 
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Table II: Performances characteristics of the laser scanner. 


Laser Scanner 


Video Cameras 


General 

- Excellent immunity to ambiant illumination 

- High-resolution images and large depth of view 

- 3-D and registered intensity images 

- New technology not as widely used 

Optics 

- Optically efficient, excellent signal-to-noise ratio 
because all the laser energy is concentrated in a 
single point 

- Simple on-axis optics 

- Large depth of view (max/min range) and small 
instantaneous field of view 

- Optical magnification produces good range 
resolution and large images of the sun to simplify 
the signal processing and reduce saturation 

- Laser light easily filtered using narrow-bandwilh 
optical interference filters 

- Eye safety requirements for terrestrial applications 
(eye-safe if laser at 1.5 pm is used) 

Scanning devices (galvanometers) 

- Large scanning angles, high pointing resolution 

- Randomly addressable, almost any scanning 
patterns can be programmed 

- Mechanically moving devices (relatively slow) 

- Temperature sensitive* 

Electronics 

- Simplify the processing of the CCD image 

- Further improve the immunity to ambiant 
illumination 

- Modular architecture easily upgradable 

- Random control and real-time implementation 
requires good knowledge of the dynamics of the 
sensor and the application** 


- Highly susceptible to ambiant illumination 

- Low hardware cost 

- Use on-board closed-circuit video cameras system 


- Simple camera configuration 

- Work typically under ambiant illumination or 
projectors 

- Adversely affected by ambiant illumination and sun 
interference 

- Large instantaneous field of view makes the system 
susceptible to saturation 

- No optical filtering possible because of the use of 
ambiant illumination (if laser projectors are used 
large-bandwith optical filters are required) 

- Compromise between optical magnification and 
field of view (focal length) and sensitivity and 
depth of view (f-number) 


- No moving parts except if rotation stage is used for 
camera pointing 

- Mechanically rugged systems 

- Temperature sensitivity is reduced by years of 
design and mature technology 


- 2-D image processing required at video rate 

- Special technique still to be developed to reduce 
the effect of changing illumination on the targets 

- Modular low-cost electronics 

- All solid-state 


* Temperature compensation is implemented using synchronization photo-detectors.*^ 

** This is valid for any accurate measurement system. 
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Figure 6 illustrates the tracking principle using a 
Lissajous figure. The target can be a cylinder, a 
sphere, or simply a colored target . For long-range 
measurement a retro reflective target (corner cube or 
3M retroreflective tape) is used. Assuming that /(/) 
and Z(t) are, respectively, the intensity and range 
measurements for each point of the Lissajous pattern 
defined by equations 1 and 2, the tracking error on 
the target is computed from the centroid 
measurements weighted by the intensity of the 
returned laser signal: 

if (J(t)>J„f) and (3) 
(Ar„i„< (r(t) -Zo) <Ar^J 
= 0 otherwise 


<®> 

5= [r,0,4>] 




ffpr^= 


Y 


( 10 ) 


C<1 (11) 




(4) 


E iv.i ( t) 


(5) 


E ” 


( 6 ) 


If a target is lost then 

Ar= A0 = A(|) =0 

if (E^v,i(t)=0) 


(7) 


0o» ^o» ''o the position of the scan and consequently 
the expected position of the target. The intensity 
threshold and range validation window and 
are specific for a target type. Any point outside 
the range/intensity window is considered not valid 
and disregarded. Range is extremely useful in 
equation 3 to make the technique very robust to 
external perturbations (e.g., r=«> for the sun) 


is the z-transform delay of one sample. is the 
direct feedback loop internal to the laser scanner, 
^prcd the predicted or expected position of the 
target, and is an external user control 

adjustement. implements a classical deadbeat 
controller reducing the tracking error to zero such that 
the Lissajous pattern is always centered on the target. 
The predicted or expected position is used to 
give an estimate of the position of the target. This 
estimate is obtained either from the known or 
calculated position of the object or from the ASVS- 
laser scanner interface, //^j allows an immediate fine 
adjustment of the target position. This is especially 
useful in the search mode when the target is not 
found (equation 7) and the predicted position is not 
accurate enough (e.g., during initial search) to lock 
the laser scanner pattern on the target. External user 
control is used to move the pattern over the target. 
Equation 11, a lossy integrator, gradually removes 
this external correction and is replaced by the 
correction 

The laser scanner automatically adjusts the size of the 
scan according to the distance of the target from the 
camera: 





( 12 ) 


Tracking is implemented in the spherical coordinate 
system (r,0,(t)) of the laser scanner using the digital 
feedback structure defined, using the z-lransform by: 


For each target, a set of parameters A 
is defined. Optimum coverage of the surface of the 
target and pointing resolution are obtained because 
the Lissajous pattern is always centered and scaled on 
the target independently of the distance of the target. 
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Multiple Tracking of Taiigets, Photogrammetiy, 
and the Laser Scanner 

The photogrammetry mode of operation of the ASVS 
to be used on board the space shuttle' and the laser 
scanner are compatible and easily interfaced; the 
deflection angles of the galvanometers directly 
provide the angular separation between the targets. 
Then, with photogrammetry techniques, the object 
position X-y-Z and attitude parameters yaw-pitch- 
rotation are computed by minimizing the quadratic 
error between the expected position of the targets 
computed from the model of the object and the 
measured positions provided by the laser scanner.^ 

Real-time and physical constraints caused by 
switching quickly between multiple targets and 
limited by the inertia of moving mechanical devices 
must be considered, based on the application 
requirements. A careful investigation of the complete 
mechanical, optical, and electrical characteristics of 
the hardware and software is needed to achieve 
optimum tracking performance. 

The laser scanner is programmed to sequentially scan 
different sections or targets on one or multiple 
objects, as illustrated in Figures 7 and 9. A list of all 
possible scanning strategies used for a given task are 
pre-defined. Each strategy contains a list of the 
visible targets to scan and is optimized for a given 
application. The laser scanner uses this list to 
sequentially scan the targets on the different objects. 

A scenario under study for the assembly of the space 
station is the berthing of the LAB/A Module on the 
APN (Aft Port Node) CBM interface in the Space 
Station using the Remote Manipulator System (RMS) 
as graphically illustrated in Figure 9. The vision 
system task is to provide the position and orientation 
of the LAB/A module relative to the APN and ITA 
(Integrated Trust Assembly, not shown) modules as 
well as visual guidance during berthing operations. 

The visible and nonvisible targets on the two main 
objects are: 

Targets Module 

T, to T« LAB/A 

T^to T,6 APN (Aft Port Node) 

Only the visible targets T, to T4 and Ty to T,2 are 
shown in Figure 9. 



Figure 9: Example of tracking for the assembly of 
the space station. 

The following strategies are defined for this visible 
section of the objects: 

Strategy = Scanning Target List 

Sq = T,,T2 ,T3,T4 

S| = T9,T|o,T,,,T|2 

S2 = Sq, T9, Sq, T|q, Sq, T|,, Sq, T,2 

It is important to acquire fast moving targets as often 
as possible because of the sequential nature of the 
scanning method. The basic scanning strategies SQand 
S, scan only the targets on their respective objects. 
They are used to initialy locate a relatively fast 
moving object. When the object is locked by the 
scanner, the scanning strategy is switched to S2 to 
measure the relative position of the LAB/A module 
with respect to the whole structure. In this example 
the APN module is considered to be almost stationary 
and therefore S2 instructs the laser scanner to scan the 
moving LAB/A module more often than the other 
objects. All the targets from the LAB/A module are 
scanned (Sq) followed by one target from the APN 
module. Although only three strategies are shown, 
any number can be defined, depending on the 
application. 

The approach used for object tracking allows the 
operator to fully define the tasks required by the 
tracking system. Instead of having a single "target or 
object" detection algorithm controlling the scanner, 
the user can dynamically tune or replace any element 
for a given application. New modules are easily built 
to improve system performances. 
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The tracking refresh rate for the LAB/A module is 
currently limited to 15 Hz for this experiment. 
Geometrical detection and tracking are 
computationaly expensive and need more processing 
power to achieve faster tracking speed than the 
current single 68020 VME processor can provide. 
Integration of parallel processing using multiple 
TMS320C40 DSP is planned. Furthermore, several 
key elements of this system, including geometrical 
target localization algorithms, absolute real-time 
calibration (vs. relative calibration), and optimized 
corrector-predictor tracking techniques, are only at 
their preliminary stage of study. As the system 
improves in performance, further modifications can 
be introduced without redesigning the whole software 
or hardware environment. 


Laser Scanner and Three-dimensional Imageiy 

A more conventional imagery method can also be 
used for tracking, at lower speed than the Lissajous 
technique. Three-dimensional imagery using a laser 
scanner has the major advantage of creating high 
resolution images of the object under inspection. 
Images of more than 4000 points x 4000 points are 
easily acquired Figure 8 illustrates the raster-type 
acquisition mode, similar to conventional video 
cameras except that 3-D range information and 
intensity, in perfect registration, are obtained for all 
the points scanned in the image. Resolution is also 
higher than with conventional CCD cameras, as 
discussed previously. It is usually assumed that an 
object is relatively stationary with respect to the 
scanner. 

Figure 10 shows a 3-D raster image of a copy of the 
CTA testing module used during testing of the SVS 
system on board Space Shuttle Flight STS-52. It is 
used here for evaluation of geometrical tracking 
algorithms. Other scanning methods have also been 
proposed based on the random access laser scanner 
discussed here.'® 


Conclusion 

Real-time tracking of targets on an object and 
acquisition of high-density three-dimensional images 
have been demonstrated using a random access 3-D 
laser scanner. The prototype has the potential of 
automatically tracking, in three dimensions, either 
geometrical features of the object itself or targets 



Figure 10: Raster type image of a model of the 
CTA used during space shuttle flight STS-52: 

(top) intensity image, (middle) range image 
encoded in grey levels , (bottom) pseudo shaded 
(derivative) of the range image (texture). 

attached to it. Excellent immunity to ambient 
illumination and sun interference is obtained. 

Real-time tracking of targets on an object is realized 
using Lissajous-type scanning patterns or raster type 
images, depending on the speed of the moving obje 
ct. The Lissajous scanning figures give high pointing 
resolution, excellent stability, and good object 
position refresh rate. 




Tracking error feedback is simultaneously obtained 
from three different sources: (1) direct tracking error 
based on each individual targets, (2) global predicted 
target position based on the calculated estimate of the 
object position, and (3) external feedback control 
from the human operator. Tracking of multiple targets 
using Lissajous patterns is based on user defined 
scanning strategies pre-programmed in the laser 
scanner according to the application requirements. 

Angular tracking resolution of 20 arcsec RMS at 150 
ft, for afield of view of 30° was measured (1/15000). 
A more complete study of the calibration parameters 
and of temperature variations on the laser scanner is 
still required to fully characterize the exact 
performance of the sensor. To obtain an absolute 
accuracy measure for the full working volume (160 
00 yd^), the difficult problem of obtaining a reference 
system more accurate than the laser scanner must be 
solved. Only resolution is quoted for the moment. 
Full calibration of the laser scanner/ASVS system 
combination is currently in progress. 


References 

[1] S.G. MacLean, M. Rioux, F. Blais, J. Grodski, P. 
Milgram, H.F.L. Pinkney, B.A. Aikenhead, "Vision 
System Development in a Space Simulation 
Laboratory", in Close-Range Photogrammetry Meets 
Machine Vision, Proc. SPIE 1394, 8-15 (1990). 

[2] H.F.L. Pinkney, "Theory and Development of an 
on-line 30 Hz Video Photogrammery System for 
Real-time 3-Dimensional ConiroV', Proceedings of the 
International Society of Photogrammetry Symposium 
on Photogrammery Industry, Stockholm, 1978. 

[3] F. Blais, M. Rioux, S.G. MacLean, "Intelligent, 
Variable Resolution Laser Scanner for the Space 
Vision System", in Acquisition, Tracking, and 
Pointing V, Proc. SPIE 1482, 473-479 (1991). 

[4] F. Blais, J.-A. Beraldin, M. Rioux, R.A. 
Couvillon, S.G. MacLean, "Development of a Real- 
time Tracking Laser Range Scanner for Space 
Applications", Proceedings Workshop on Computer 
Vision for Space Applications, Antibes, France, 1993. 

[5] M. Rioux, "Laser Range Finder Based on 
Synchronized Scanners", AppL Opt., 23(21), 3837- 
3844 (1984). 


[6] J.-A. Beraldin, M. Rioux, F. Blais, G. Godin, R. 
Baribeau, "Model-based Calibration of a Range 
C 2 iir\era'\ Proceedings lA PR,The Hague, Netherlands, 
1992. 

[7] J.A. Beraldin, S.F. El-Hakin, L. Cournoyer, 
"Pratical Range Camera Calibration", in Videometrics 
II, Proc. SPIE 2067, 21-31 (1993). 

[8] F, Blais, M. Rioux, J.A. Beraldin, "Practical 
Considerations for a Design of a High Precision 3-D 
Laser Scanner System", in Optomechanical and 
Electro-Optical Design of Industrial Systems, Proc. 
SPIE 959, 225-246 (1988). 

[9] S.J. Orfanadis, "Optimum Signal Processing: An 
Introduction", Macmillan Publishing Company, 2nd 
Edition, 1985. 

[10] Liu Y., Laurendeau D., "Processing of a Multi- 
scale Triangulated Surface Model of a 3-D Scene for 
a Robotics Application", in Geometric Methods in 
Computer Vision II, Proc. SPIE 2031, 392-403 
(1993). 


472 


AIAA-94-1242-CP 

INTEGRATION FOR NAVIGATION 
ON THE UMASS MOBILE PERCEPTION LAB 


Bruce Draper**, Claude Fennema^, Benny Rochwerger'*' 
Edward Riseman*> Allen Hanson* 


Abstract 

The Mobile Perception Laboratory (MPL) is an 
autonomous vehicle designed for testing visually 
guided behaviors, such as road following, passive 
obstacle detection, landmark-based navigation and 
model acquisition [Hanson, Riseman, & Weems 
93]. The research is being conducted under the 
sponsorship of the ARPA Unmanned Ground 
Vehicle (UGV) Program in the Computer Vision 
Laboratory at the University of Massachusetts. 

The focus of this paper is on integrating multiple 
behaviors into a single, coherent system. It 
presents the ISR3, a new tool for interprocess 
communication, the storage and retrieval of 
transient, image-based data, and the long-term 
management of 3D data. It also presents the script 
monitor, a process control mechanism for 
invoking, monitoring and destroying concurrent 
sets of visual behaviors in response to dynamic 
events and the systems stated goals. 

1. The Mobile Perception Lab (MPL) 

The experimental laboratory vehicle for this effort 
is the UMass Mobile Perception Lab (MPL), a 
heavily modified Army HMMWV ambulance 
(Figure 1) that is equipped with actuators and 
encoders for the throttle, steering and brake. The 
interface to the on-board computer system is 
through a 68030-based controller board. Electrical 
power is provided by an on-board lOkW diesel 
generator feeding uninteruptable power supplies 
and conditioners. The MPL closely matches 
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CMU’s NavLab II, with modifications and 
component installation performed by RedZone, 
Inc., a Pittsburgh-based firm specializing in 
custom robotics. 



Figure 1. The Mobile Perception Laboratory 


The vehicle's sensor package includes a Staget, 
which is a stabilized platform capable of rotating a 

full 360^. The Staget is mounted at the center of 
the cab roof and contains a CCD color camera and 
a FLIR sensor in a weatherproof enclosure. Two 
forward-looking stereo cameras and a forward 
looking color CCD camera are mounted in a 
rectangular enclosure at the front edge of the cab's 
roof. The primary computing engine for vision 
processing, goal-oriented reasoning and path 
planning is a Silicon Graphics 340GX four-node 
multiprocessor. The multiprocessor is interfaced 
to the sensor suite through a Datacube 
MaxVideo20 processor, which provides frame rate 
image processing for certain types of operators. 
Space and power has been provided for the 
possible addition in the future of a 16K Image 
Understanding Architecture (lUA) [Weems 1993], 
a massively parallel heterogeneous processor. 
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The physical lay-out of equipment on the vehicle 
is depicted in Figure 2. The first programmer 
station is located in the HMMWV's passenger seat, 
with a 17" color X-terminal fixed to the metal 
platform between the passenger’s and driver’s 
seats. The second programmer station is located 
behind and slightly above the driver, and includes 
a car seat, mounting brackets for both an SGI color 
terminal and a small SONY monitor for viewing 


those modules. At the same time, MPL’s software 
environment must be efficient enough to meet the 
demands of real-time navigation research. 

'The need to balance between flexibility and 
efficiency has led us to design a software 
environment around the ISR3, an in-memory 
database that allows users to define structures for 
storing visual data, such as images, lines and 
surfaces [Draper 93a,b]. 1SR3 serves as a process 
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Figure 2. Interior layout of the MPL. 


raw TV signals. Behind this programmer station is 
the diesel generator and power conditioning 
systems. The passenger’s side rear hold four 
enclosed, air conditioned 19" computer frames for 
the on-board computer systems. The first frame 
holds the vehicle controller, image digitizers and 
frame stores, the MaxVideo20, and the Staget 
controller and interface. The second computing 
frame contains the Silicon Graphics 
multiprocessor, disk drives, power supply and 
(removable) tape drive. The third frame is 
reserved for the Image Understanding Architecture 
(lUA). The fourth frame contains video recorders 
for collecting experimental data. 

2. Integration of Vision Modules for Real- 
Time Contro l: An Overview 

MPL is an experimental laboratory for testing and 
integrating different approaches to problems in 
autonomous navigation, including, but not limited 
to, landmark-based navigation, obstacle detection 
and avoidance, model acquisition and extension , 
road following, and path planning. It is therefore 
important that MPL have a software environment 
where multiple visual modules, addressing 
different subtasks, can be easily integrated, and 
where researchers can quickly experiment with 
different combinations and parameterizations of 


communication interface, so that, for example, 
lines produced by one module can be used by 
another, even if the second module is run later or 
on a different processor than the first. ISR3 also 
provides modules with efficient spatial access 
routines for visual data, and protects data from 
being simultaneously modified by two or more 
concurrent processes. A graphical programming 
interface allows programmers to easily sequence 
modules and modify their parameters. 

Our work on planning has concentrated on plan 
execution rather than plan generation; thus, we 
assume that plans are developed by the user (by 
hand) or by an appropriate planning subsystem. 
With this goal in mind, a control system, called the 
Script Monitor, has been implemented to support 
real-time execution of plans [Rochwerger et. al. 

94] . The task of autonomous navigation, as with 
many other complex tasks, can be decomposed 
into specific subtasks like road following, obstacle 
avoidance and landmark recognition. In order to 
achieve a fully autonomous capability, the 
solutions to all these subproblems must be 
integrated into a coherent system. This paper 
focuses on the preliminary work done on the 
problem of integration for the UMass Mobile 
Perception Laboratory (MPL). Since each of the 
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subproblems is still a field of active research in 
which approaches and solutions may change 
rapidly over time, the integrated system should be 
flexible enough to allow testing different sub- 
systems and different control strategies. 

To achieve the desired flexibility we have 
implemented the control system as a 
“programmable” finite state machine (FSM), in 
which the states represent the different modes of 
operation of the system (behaviors) and the state 
transitions correspond to the system's reactions to 
events (either external or internal). The 
composition of the states and the transitions is not 
fixed, i.e., the FSM can be tailored to the 
particular task the system is trying to achieve. 

3. Integation via the ISR Database. 

MPL’s database is built around 1SR3, an in- 
memory database for computer vision. 
Historically, ISR (an acronym for intermediate 
symbolic representation) has been the name of a 
series of symbolic databases for vision developed 
at the University of Massachusetts [Brolio et. al. 
89]. One version, ISR 1.5, is now commercially 
available as part of KBVision™ [Williams 90], 
while the most recent version, 1SR3, is used on- 
board the MPL. The ISR databases reflect a belief 
that computer vision requires more than image- 
like arrays of numerical data; computer vision 
depends on symbolic representations of abstract 
image events such as regions, lines, and surfaces, 
and on mechanisms for efficiently accessing data 
objects by a model-based system under various 
types of constraints (such as spatial proximity). 
Although each version of ISR has been a 
refinement of its predecessor, they all assume that 
visual procedures operate on symbolic records, 
called tokens, or on groups of tokens, and that 
visual procedures manipulate tokens both for 
internal computations and for exchanging data 
with other procedures. 

1SR3 is an in-memory database for C structures. It 
reads a C header file of structure definitions when 
it is first initialized, and records the size of the 
structures (called tokens) and the data types and 
offsets of their fields. ISR3 then establishes a 
database for these structures, establishing queues 
for allocating and freeing tokens, semaphores for 
preventing simultaneous access, and functions for 
storing and retrieving tokens. Once one process 
has initialized ISR3, other processes can attach to 
it (since aU tokens are kept in shared memory). 


making ISR3 a communication mechanism as well 
as a data store. 

2.1-ALYINPi 

In order to emphasize ISR3’s role as an integration 
mechanism, its communication, synchronization, 
and data storage capabilities will be described in 
the context of specific MPL tasks. One of MPL’s 
most basic tasks is to drive down roads, using the 
ALVINN neural-net road-following system 
[Pomerleau 90, 92]. Developed at Camegie- 
Mellon University, ALVINN is a neural network 
with a single hidden layer that produces steering 
vectors from reduced (32x30) and color- 
compensated intensity images ^ When run alone it 
implements a simple road following behavior by 
grabbing images from a forward-looking camera 
and sending commands to the (low-level) vehicle 
controller to correct for drift and turns in the road 
in order to keep the vehicle on course. 

On the MPL ALVINN is almost never run in 
isolation, however. At the very least it is run with 
an obstacle detection program to ensure the safety 
of the vehicle (otherwise ALVINN is more than 
happy to run into obstacles). ALVINN is 
integrated with the obstacle detection program by 
interposing an arbiter between ALVINN and the 
vehicle controller. The arbiter takes the steering 
vectors produced by ALVINN and combines them 
with the results of obstacle detection to make sure 
the vehicle avoids obstructions. 

ALVINN is integrated with the rest of MPL's 
software by declaring its steering vectors to be 
ISR3 tokens stored in shared memory. ALVINN 
can then be tested in isolation from other systems 
by having the controller read ALVINN's steering 
tokens directly, and combined with other systems 
by having the arbiter retrieve ALVINN’s tokens 
and produce its own (modified) steering tokens for 
the controller. A similar principle can be applied 
to image acquisition, with a data server producing 
image tokens and storing them in shared memory 
for ALVINN (or any other process) to retrieve. 

3.2 Landmark Recogni tion and Positioning 
Although ALVINN demonstrates how ISR3 can 
help integrate software modules, including 
modules developed at other sites, it does not 
exercise all of the ISR3 capabilities. MPL's 


* Properly speaking, an in-memory database is called a 
data store. 
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landmark recognition system is a combination of 
four software modules that together match 3D 
landmark (or object) models to images and 
determine the position and orientation of the 
camera relative to the landmark. The first module 
uses color and texture information to limit the 
search for the landmark to a specific region of 
interest (ROI), the second extracts straight (2D) 
line segments from the ROI, the third projects the 
3D landmark model according to the estimated 
viewing point and determines which (3D) lines 
should be visible, and the fourth matches (2D) 
image lines to visible (3D) model lines and 
determines the relative position of the vehicle to 
the landmarks in the world coordinate system. 

As with ALVINN and the arbiter, ISR3 is the 
communication mechanism that allows the four 
landmark recognition modules to exchange data, 
this time by declaring ROls, (2D) image lines, 
(3D) model lines and (3D) faces to be ISR3 
tokens. The landmark recognition modules are 
more typical of vision applications than ALVINN, 
however, in that they produce large numbers of 
tokens which are typically accessed by name, 
feature value or spati^ location. Spatial access is 
particularly important for the matching modules, 
which must repeatedly access image lines near the 
projections of model lines (according to the 
current pose estimate). 

Most landmark recognition tokens, particularly 
ROIs and image lines, exist for only a short period 
of time (in this case the time required to process 
one image) and should then be deallocated. 
Consequently, file I/O is not optimized: although 
data is sometimes saved for later analysis in the 
lab, most data can be kept in memory and then 
erased without ever being saved to disk (ignoring 
the effects of paged virtual memories). Token 
allocation and deallocation, on the other hand, are 
critical, which is why ISR3 maintains its own 
token queues. 

The landmark recognition processes are also 
typical of many vision application programs in that 
they both produce and consume sets of tokens 
rather than single tokens. The matching module, 
for example, finds (potentially many-to-many) 
correspondences between sets of image lines and 
sets of model lines. Therefore most storage and 
retrieval commands in ISR3 are in the form of set 
operations, such as a request to access “all long, 
straight lines in the upper comer of an image”. 


Synchronization is provided, not at the level of 
individual tokens, but of sets of tokens, so that 
processes may iterate over sets of tokens without 
having to lock and unlock each token individually. 
Special facilities for optimizing spatial retrieval 
over individual or arbitrary sets of tokens are also 
provided, as are macros for iterating over the 
tokens in a set, and functions for taking the union, 
intersection and differences of sets^. 

Although most tokens are temporary, ISR3 also 
provides permanent storage for those few sets of 
tokens (critical features, updated maps, etc.) that 
correspond to significant results and should persist 
over time. Model acquisition processes, such as 
one described by Sawhney [Sawhney 93] produce 
3D models of the type used for object recognition 
(i.e. 3D points, lines and faces). These models, 
once learned, should be permanently stored, and 
the landmark recognition processes should always 
have access to the most recent version of any 
model. With regard to these tokens, therefore, 
ISR3 serves as a permanent data base system that 
provides storage for, and stmetured access to, 
long-term data. 

3.3. ISR3 Protection and Memory 

Management Mechanisms 
Although ISR3 acts as a general database system, 
it has been optimized for real-time vision research 
by reducing overhead wherever possible while still 
supporting those functions most often used in 
computer vision. For example, one task of a 
database in a multiprocess environment is to stop 
processes from accidentally overwriting or 
destroying each other's data. In a typical computer 
vision application, hundreds or thousands of 
tokens may be in memory at one time. If multiple 
processes have uncontrolled access to these tokens 
and modify them, unpredictable interactions will 
cause elusive and non-repeatable bugs. On the 
other hand, it is not uncommon for a process such 
as the model matcher to access hundreds of tokens 
at a time. If it had to lock and unlock a token each 
time it reads a feature value, protection would 
become unacceptably expensive, especially given 
the relatively slow speed of semaphores under 
UNIX. 

A compromise used in ISR3, therefore, was to 
associate semaphores with sets of tokens. When a 


^The complement of a set is not well defined, since 
there is an infinite universe of possible tokens. 
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set of related tokens is created, for example the set 
of lines extracted from a ROl, a semaphore is 
allocated to protect those tokens. Any 1SR3 
function that accesses that set of tokens will first 
check the semaphore; ISR3 access functions that 
create subsets of a defined set of tokens assign the 
same semaphore to the subset that is used for the 
parent set. If users access tokens surreptitiously 
through C pointers^, they are expected to lock the 
semaphore before accessing the first token and to 
unlock it after the last token access. As a result, as 
long as users do not circumvent its safeguards, 
ISR3 is able to provide process synchronization 
with very little overhead. 

Similarly, ISR3 provides a level of memory 
management on top of the UNIX operating 
system. For non-real-time, file-based systems, 
memory management is not a critical issue; for 
continuous, real-time systems, however, it is 
crucial. ISR3 applications operate in real-time 
loops, allocating new tokens on each iteration. 
Memory allocation must be rapid, and must avoid 
fragmenting memory (as repeated calls to ntalloc 
would). Memory must be recycled, with space 
aUocated to old tokens being reassigned to new 
ones once the old data is no longer needed. ISR3 
satisfies these requirements by providing token 
buffers (at a level hidden from the user). Calls to 
create a token actually allocate one from the buffer 
for that token type, and freed tokens are returned 
to the appropriate buffer. For users who store 
tokens in hierarchies, ISR3 also provides functions 
for tracing through a hierarchy and freeing all the 
tokens in it, so that, for example, one can free all 
the memory associated with an image once the 
image is no longer current. 

4. Executing Reactive Behaviours: Scripts 
and the Script Monitor 

The term behavior has generally been used in the 
literature to describe processes that connect 
perception to action, i.e., a behavior senses the 
environment and takes an appropriate action based 
on what was perceived. A combination of 
behaviors is also called a behavior; thus, a 
complex behavior can be achieved by combining 
simpler behaviors. In Brooks' subsumption 
arcWtecture [Brooks 86], the task of robot control 


^Since ISR3 stores arbitrary C Snuctures, there is no 
way to prevent users from accessing tokens without 
using ISR3 functions, once the user has obtained one 
pointer into the database. 


is decomposed into levels of competence; each 
level, in combination with lower levels, defines a 
behavior. In their Distributed Architecture for 
Mobile Navigation system (DAMN), Payton, 
Rosenblatt and Keirsey [Payton 86; Payton et. al. 
87] refer to behaviors as very low level decision- 
making processes which are guided by high level 
plans and combined through arbitration. In their 
DEDS work, Ramadge and Wonham [Ramadge 
89], as well as Rivlin [Rivlin], events are 

considered the alphabet, £, of a formal language. 
A behavior is then a sequence of events, or a string 
over L*. Note that in this terminology every 
prefix of a string is also a behavior, i.e., the 
sequential combination of behaviors is a behavior. 

These definitions, although consistent, can be 
confusing - the same term is used for individual 
processes and for the composition of these 
processes. We have chosen to think of a behavior 
as a mode of operation [Payton et. al. 90], in which 
several perception-action processes [Draper 94] 
are executed concurrently. Each process converts 
sensory data into some kind of action (either 
physical or cognitive), and at any time may 
generate an event - a signal to let the system know 
that “something” significant has occurred. All 
inter-process communication is achieved through a 
global blackboard -a section of shared memory 
accessible to aU processes. The blackboard is built 
on top of the ISR3 which, as described earlier, 
provides a very efficient memory management 
mechanism (on top of UNIX) and a set of 
primitives necessary for shared memory based 
communication. 

±L A Finite State Machine Representation 

for Behaviors 

An autonomous system must react to events by 
changing its behavior; hence the sequence of 
behaviors actually executed depends upon the 
sequence of events. Since the latter is 
unpredictable, so is the former. To program such a 
system, one must specify which processes 
constitute a behavior and for each possible event 
describe the system's reaction. In order to specify 
such a system, a finite state machine (FSM) 
formalism has been chosen, in which the states 
represent behaviors and the transitions reactions to 
events. 

As a simple illustrative example, the following set 
of statements describe a system that will drive on 
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the toad while obeying traffic lights, until a given 
distance is traveled: 

While driving dawn the road 

the treble light turns red, wait, 
if goal is reached, stop. 

While waiting at the trcffic light, 
it turns green, start driving. 

These statements correspond to the simple FSM in 
Figure 3a; note that a state represents a behavior or 
mode of operation, i.e, a set of concurrent 
perception-action processes. This example can be 
implemented with four perception-action 
processes: follow the road (rfi, check for traffic 
lights (tl), monitor the distance traveled (dm), and 
stop the vehicle (vs). Listed below the state name 
are the perception-action processes that should be 
mn and killed (marked with a ~) in that state. 

Based on the notion of behaviors represented as 
states of a finite state machine, a Behavior 
Description Language (BDL) was designed and 
implemented. Behaviors are described as two sets 
of perception-action processes, and a transition 
table. The run set specifies the minimum set of 
processes that form the behavior; the kill set 
specifies those processes that should not be 
miming for the correct execution of the behavior. 
The choice of two sets implies that processes that 
were running when the behavior started will 
continue to mn unless explicitly killed. The 
transition table specifies what to do for each of the 
valid events (events not specified in the state 
description are not valid). The representation of 
our simple example expressed in the BDL is 
shown in Figure 3b. First, the perception-action 
processes available are listed. Then the set of 
states (or behaviors) and the set of events are 
declared. Finally, a description of each state is 
provided; parameters required for the perception- 
action processes associated with each state are 
fetched from the blackboard prior to their 
initiation. For a more complete example, see 
Section 4.4. 



PROCS = { 

rf "DrlveOnRoad” 

tl ”CheckTralTlcLlghr 

vs "VehIcIcStop” 

dm "DistanceMonItor'*} 

STATES = {drive, wait, stop} 

EVENTS = (red, green, done) 

WHILE drlve(d)( 

SET distance = d; 

RUN rf, tl, dm; 

EVENT red GOTO wait; 

EVENT done GOTO stop;} 

WHILE wait 0 ( 

KILL rf; 

RUN vs,tl; 

EVENT green GOTO drive; 

(b) 

Figure 3. Script of a simple driving 
system, (a) Finite state machine (FSM) 
representation, (b) Behaviour Description 
Language (BDL) representation. 

4.2. Scripts - Augmented finite state 
machines 

The simple FSM model discussed in the previous 
section has been augmented in two ways. First, a 
fetch-goal state was added to the set of states as a 
mechanism for compacting the representation. 
The system starts in the fetch-goal state where it 
reads goals (in terms of behaviours) from a 
precompiled plan. When a goal is retrieved, the 
script monitor writes relevant blackboard 
messages into the blackboard before creating the 
perception-action processes associated with the 
state. Once a perception-process is mnning, it 
looks for its parameters in the blackboard, which is 
implemented within the ISR structure described 
earlier. The fetch-goal state differs from all other 
states in two ways: (1) transitions out of the state 
are unlabelled since the next state is explicitly 
specified in the goal retrieved from the script; (2) 
Unless the plan is empty, the kill and run sets are 
ignored. 


478 


With these change, a script 5 is formally defined 
as the eight-tuple (P,Q^M,S,K,p,G), where: 

• P is the set of available perception-action 

processes. 

• C is the set of states (Q = B u fetch-goal, 

where B is the set of behaviors). 

• £ is the set of possible discrete events 

(transitions in the FSM). 

• Af is the set of valid blackboard messages. 

• 5is the transition table, d.BxE-^ Q. 

• vis the kill table, k: BxP [0, ]}. 

• p is the run table, p : fl X F -» {0,7}. 

• G is the plan expressed in terms of subgoals. 

Each subgoal is of the fonn <bg,Mg>, foxbg 

e B and Mg c M. 

Scripts can be generated by an automated planner, 
or by hand (using BDL). 

The Script Monitor 

The script monitor is in charge of “high level” 
control^: reading, interpreting, and executing BDL 
scripts. Essentially, the script monitor is a plan 
execution system [Georgeff 90] similar to PRS 
[Ingrand et. al. 92; Lee et. al. 93] in some aspects. 
The monitor does not perform any direct action on 
the vehicle controller by itself; rather, it controls 
the set of running processes which dfi take direct 
action. 

The script monitor consists of two modules, an 
interpreter and an execution system. The 
interpreter takes a BDL script S and builds the 

transition (5), kill ( x) and run (p) tables. After 
this, the subgoals in S's plan are stacked into the 
execution stack G. The execution system simulates 
a finite state machine as shown in Table 1. 

Gearly, for the system to complete the task in G, 
the following must hold: 

VAe Q 3s lye E* s.t. 5\b,sij)= fetch- goal 

where 5' is the transition function applied to a 
sequence of events [Hopcroft and Ullman 79]. 

^ In this context, "high level" control is used to 
differentiate the control of processes from the “low 
level” control of the vehicle actuators. 


Id. A More Complete Example and an 

Experiment 

A set of perception-action processes have been 
implemented for the MPL. TTiese include: 

• Vehicle pose determination based on 

landmark model matching [Beveridge 93; 
Draper 93b; Kumar 92,93]. 

• Neural-network road following (ALVINN) 

[Pomerieau 90]. 

• Servo-based steering [Feiuiema and Hanson 

90; Fennema91]. 

• Obstacle detection via stereo [Badal et. al. 

94]. 

• Reflexive obstacle avoidance [Ravela et. al. 

94]. 

• A distance monitor. 

• Turning via dead reckoning. 

• Servo to compass heading. 

• Harmonic function path planner [Connolly et. 

al. 92,93] 

In the future, additional processes will be added, 
including landmark tracking, recognition and 
modeling of natural landmarks, automatic 
landmark extension, etc. 

Using a subset of these processes, an experiment 
was designed in order to demonstrate the 
capabilities of the vehicle and the performance of 
the independent perception-action processes. The 
following script was successfully tested on the 
vehicle at the UMass test site: 

(1) Drive on the road, while avoiding 
obstacles, for x meters. 

(2) Estimate vehicle position using 
landmarks. 

(3) Drive on the road, while avoiding 
obstacles, for y meters. 

(4) Estimate vehicle position using 
landmarks. 

(5) Turn left (at the experimental site this 

command is a transition to off-road 
navigation). 

(6) Drive off road (by servoing on a 
compass heading), while avoiding 
obstacles, for z meters. 
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1. bg-^ fetch-goal (Start at the fetching state) 


2. if (bg = fetch-goal) then 


(a) if G is empty then '^pe P 


i. kill (p) 

(Terminate ALL processes) 

ii. if (p(bg, p) = 1) then mn(p) 

(Run cleanup processes) 

iii. stop 

(S was successfully executed) 

(b) <bg, Mg> <- pop(G) 

(Fetch next goal) 

(c) blackboard 

(Write blackboard messages) 

3. Vpe Fif x(6^,p) = 7)thenkill(p) 

Terminate processes) 

4. Vpe Fif (p(6g,p) = 7) then run(p) 

(Create processes) 

5. Wait for an event ee E 

(Wait) 

6. bo i — d(bg,e) 

(React to event - follow transition table) 

7. goto 2 

(Repeat until all goals are achieved) 


Table 1. Script Monitor Execution Subsystem 


The perception-action processes involved in this 
example include pose estimation (pe), road 
following (rf), obstacle detection (pd), obstacle 
avoidance (oa), servoing to a compass heading 
(se), distance monitor (dm) and dead reckoning 
turning (dt). The full BDL script for this 


coordinated action, with x = 100, y 
50, is: 

PROCS = 

! 

i 

pe 

"PoseEstimate" 

rf 

"RoadFollow" 

od 

"ObstacleDetect" 

oa 

"Obstacle Avoid" 

se 

"Servo" 

dm 

"DistanceMonitor" 

dt 

"DeadReckoningTum" 

vs 

"VehicleStop") 


STATES={ drive-onroad, drive-offroad, turn, 
compute -pose, avoid-obstacles} 

EVENTS = {success, obstacles, clear) 

WHILE drive-onroad (dist) { 

SET distance = dist; 

RUN rf, od, dm; 

EVENT success GOTO compute-pose; 
EVENT obstacle GOTO avoid-obstacles; } 

WHILE drive-offroad (dist) { 

SET distance = dist; 

RUN se, od, dm; 

EVENT success GOTO compute-pose; 
EVENT obstacle GOTO avoid-obstacles;} 


WHILE turn (dir, dist) { 

SET direction = dir, 

SET distance = dist; 

RUN dt, dm; 

EVENT success GOTO fetch;} 

WHILE avoid-obstacles ( ) { 

KILL rf, se; 

RUN 02 od’ 

EVENT clear GOTO BACK; } 

WHILE compute-pose ( ) { 

KILL rf, se; 

RUNpe; 

EVENT success GOTO fetch; } 

GOALS { 

drive-onroad (100); 
drive-onroad (150); 
turn (left, 10); 
drive-offroad (50);) 

The augmented finite state machine for this script 
is shown in Figure 4. Note that the figure shows 
the addition of the fetch-goal state discussed 
earlier. The FSM also shows a potential link to the 
high level planning system (not yet implemented) 
which is activated (in this example) by failure of 
the pose estimation state to localize the vehicle. In 
this case, the vehicle is considered to be lost - it 
then stops and initiates planning to resolve its 
location. Since planning may result in 
modifications of the goal stack, the addition of the 
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planning state^ may theoretically change the 
model to a push-down automata (PDA). 



navigation. 

5. Summary and Conclusions 

One of the requirements for MPL’s software 
environment was that it should support the 
integration of real-time visual procedures with as 
little overhead as possible. This meant that the 
focus of the system design had to be on the two 
critical areas of data storage/exchange and process 
control. Data storage and exchange is supported by 
the ISR3 real-time database/datastore system, 
which provides a central data repository and 
communication mechanism for all the perception- 
action processes running on the MPL. Behaviors 
are collections of concurrent perception-action 
processes whose interaction and execution are 
controlled through a script; MPL’s script monitor 
is a low-overhead control system that switches 
from one behavior to the next in response to 
external conditions. 

Although limited, the script monitor system in its 
current form has given us an idea of the 
complexity involved in building intelligent 
controllers, particularly in a real-time application 
domain where safety must be ensured. Encoding 
system reactions as a finite state machine seems a 
reasonable approach, but it is not clear how to 
optimally construct the individual states, i.e. which 
perception-action processes constitute each state, 
and how these processes interact with one another. 


50r any state that can write to the goal stack. 


In the current implementation, all communication 
between processes is done through the blackboard. 
In this particular domain, where potentially large 
amounts of data are shared (images, maps, etc.), 
the shared memory paradigm seems the most 
efficient method of communication for processes 
mnning on the same machine. But if perception- 
action processes were to run in a distributed 
architecture, other means of communication will 
be necessary (UNIX sockets or a system such as 
TCX [Fedor 93]). In the current implementation, 
it was assumed that scarcity of resources was not 
an issue. However, independently executing 
concurrent processes which access sensors and 
send commands to actuators, or otherwise affect 
other scarce system resources, will inevitably 
cause problems if resource allocation is not 
handled correctly. 

Resource scheduling and sharing, inter-process 
communication and real-time control are difficult 
problems, particularly in a real-time dynamic 
environment, and efficient solutions to them are 
essential if robust autonomous systems arc to be 
constructed. 
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Abstract 

Based on experience with real-time image sequence 
processing systems in the application areas of vehicle dock- 
ing, road vehicle guidance, AGV’s on the factory floor, 
aircraft landing approaches and of dynamical grasping of 
free floating objects in space with remote control from the 
ground including long delay times, an efficient, distributed, 
expectation- as well as object-based general dynamic vision 
system architecture has been developed. Parallelization is 
stmctured according to physical objects, the characteristics 
of which with respect to 3-D shape, motion behavior, visual 
appearance and all other significant properties in the task 
context are represented internally in generic form. Both 
differential representations for state estimation as well as 
behavior control, and integral ones for mission plarming, 
mission control and monitoring are being used. References 
to detailed reports on all application areas mentioned are 
given. 


Introduction 

The sense of vision is the predominant source of infor- 
mation for intelligent motion control in biological systems; 
why has it been missing in technical systems almost entirely 
until very recently? There are at least two basic reasons: 
First, human visual capabilities are well developed, and 
similar real-time performance on the technical side requires 
computing capabihties not nearly available until about a 
decade ago; the data flow in a color video signal is of the 
order of magnitude 10^ Bytes per second (10 MB/s) while 
clock rates of computers are between 10 and 100 MHz. 
Assuming 10 to 100 operations per data point (or ’picture 
element’) in the image (this will be abbreviated in the sequel 
as ’pel’ or ’pixel’) it is immediately seen that many parallel 
processors are needed for real-time performance just for the 
image sequence processing part, let alone dynamic scene 
understanding, control computation and mission monitor- 
ing. 

The second reason is, that -unlike in biological sy stems- 
digital image processing started from static single image 
evaluations as in remote sensing applications. Until the early 
80ies, when researchers from the field of control engineer- 
ing moved into this newly developing field of image se- 
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quence evaluation, the approach to this field has been domi- 
nated by ’quasi-static’ thinking; time has been introduced 
through the backdoor by differencing between images and 
starting from so-called ’optical flow’ informatioa 

However, from linear systems theory in the field of 
najectory reconstruction based on noise cormpted measure- 
ments, recursive estimation techniques have been known 
since the early 60ies^ which allow to substitute knowledge 
about tlie real-world processes to be observed for missing 
or poor-quality data. These so-called ’dynamical models’ 
represent temporal dependencies explicitly as side con- 
straints for data interpretation during process evolution over 
time. Exploiting these constraints systematically in conjunc- 
tion with spatial shape characteristics resulted in increased 
image sequence processing efficiency by orders of magni- 
tude^. 

This is due to the fact that temporal predictions using the 
dynamical models allow to control both assignments of 
image regions to parallel processors and the extractions of 
features by special algorithms in limited search regions 
depending on the situation encountered; all of this is geared 
to objects of specific classes for which corresponding 
generic knowledge is represented in ’object processor 
groups*. This leads to efficient local communication stme- 
tures and to a modular system design^. 

At UniBwM this approach has been applied to half a 
dozen different guidance and control tasks. It became well 
known in the field of visual guidance for autonomous land 
vehicles where surprising performance levels have been 
acliieved witli very moderate computing power. Over the- 
last several years the approach has enjoyed increasingly 
widespread use worldwide; there are many variants do- 
cumented in the literature and their number is increasing 
rapidly 

In this paper, a survey is given on the general method as 
it presently stands at UniBwM. The following section co- 
vers tlie method proper featuring the basic ideas like 4-D 
representation, orientation towards physical objects, expec- 
tations and prediction error feedback as well as the central 
role the Jacobian matrices play for the relationship between 
image features and object state components; with increasing 
numbers of sensors and of objects in the scene analysed, the 
management scheme of the perception system had to 


483 



become capable of dealing with occlusions, partial observa- linearizations of all nonlinear relationships are sufficiently 

bility due to aspect conditions and witli model switching on good; only the last image of the sequence needs be worked 

top of the basic distinction between initialisation and track* with, tliercby avoiding storage and retrieval problems with 

ing phases . previous ones. This enormously alleviates the data handling 

The applications to mobile robots discussed in a survey problem; no optical flow needs be computed. However, the 

fashion, following this display of the metliod, are: rendez- spatial velocity components are obtained by smoothing 

vous and docking (phmar reaction controlled satellite dock- numerical operations. 

ing, and spatial autonomous landing approaches of aircraft). This prediction error feedback scheme for each object 

surface vehicle guidance with local reactions (road runner leads to a servo-maintained internal representation dupheat- 

including obstacle avoidance or ’intelligent cruise control’) ing all essential aspects of tlie real-world subprocesses being 

and global mission control (landmark navigation both on the individually observed and analy sed. Tlie integral interpreta- 

factory floor and on an outside road network). tion in 3-D space and time has led to the name ’4-D ap- 

The dynamical grasping experiment during Spacelab proach’. Figure 1 shows the resulting block diagram for a 

mission D2 in May 1993 concludes this application survey; single imaging sensor and multiple objects besides the own 

with sensors and the robot arm as effector onboard the Space veliicle to be controlled. 

Shuttle Columbia, and with tlie computers on tlie ground tliis In parallel to the real world (shown in tlie upper left 

teleoperated (’remotely brained’) system lias achieved the block) with motion processes happening in 3-D space and 

first capture of a free-floating object in space by delayed time, an internal representation, also in 3-D space and time 

visual feedback (5.8 seconds !). but limited to the most essential aspects for die actual task 

at hand, is built up in the interpretation process by prediction 
error feedback (’internally represented world’ in upper right 
The 4-D approach block of fig.l). There is a fundamental difference between 

the initialisation pliase when a new object is being dis- 
The 4D approach to dynamic machine vision exploits covered, and the tracking phase when temporal continuity 

dynamical models in the form of state transition and control conditions yield a good guideline for understanding the 

effect matrices for sampled data systems with cycle times evolution of the dynamical scene observed. The basic ideas 

as multiples of the basic video cycle time (20 ms in Europe, will be discussed in turn. 

16 2/3 ms in the USA); the recursive 
state estimation methods well 


known in systems dynamics for 

smoothing botli process and mcas- Real worid (1) 

urement noise are combined here Processes in space” 

with 3-D shape representations of and time 
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least squares Kalman filter algo- 
rithm for noise reduction indirectly 
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ficiently accurate approximate man- 
ner, recovering the third dimension 
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25 Hz for image sequence interpretation, the underlying 
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Fig. 1: Basic prediction error feedback scheme for single object 
and multiple imaging sensor 

Basic ideas 


Five essential elements constitute the base of the ap- 
proach during continuous observation of a moving object 
(tracking phase): 
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Edge element features: The most basic element to effi- 
cient image sequence processing in a steadily changing 
environment is to fully exploit the difference between data 
and information. A uniformly grey image contains the same 
amount of data as a page with text and pictures from com- 
plex scenes; however, in the former case a human observer 
completely describes the information content by two words: 
’uniformly grey’, while for the latter one he may have to talk 
for several minutes in order to convey at least the most 
essential aspects. If the image would contain two differently 
colored areas with a curved boundary, the most economical 
way of capturing the information content in a symbolic 
description would be to formulate the boundary between the 
colored regions by the geometry of a line (straight segments 
or curves with given curvature along the arc length) and by 
specifying the homogeneous areas by two color symbols; 
again, the information exhaustively describing the image 
can be coded in orders of magnitude less data as compared 
to the pixels involved. Using tangent direction information 
and points of discontinuity (comers) seems to be an efficient 
coding scheme for boundaries in an image. Supposedly, this 
is the reason behind nature having developed the capability 
of doing just this in some of the high performance biological 
vision systems (striate cortex in VI). 

Looking for dark-to-bright transitions (or vice-versa) 
irrespective of the absolute brightness level or spectral in- 
formation content makes these systems less dependent on 
threshold values and thus more robust. Confining these 
tangent direction measurements to closely spaced discrete 
points in the image plane yields a natural discretisation 
allowing to constmct both smooth curves from assumed 
linear changes of curvature along arc length in a differential 
geometiy interpretation (corresponding to third order poly- 
nomials in Cartesian space over not too large arc lengths) 
and sharp corners when tangent directions are far apart even 
though their centers are closely spaced^"^ . By these tangents 
(edge elements) any shape can be represented by corre- 
sponding feature groupings; in a multiple scale concept, 
mask operators for feature extraction can detect daik-to- 
bright transitions on several scales thereby allowing object 
detection and characterization with different resolutions; for 
example, for many practical purposes in vehicle guidance it 
is sufficient to characterize obstacles by the encasing rec- 
tangle or box. In bifocal vision with different focal lengths 
evaluated simultaneously, this gives an easy choice for 
either fast tracking or more precise shape determination 

Therefore, the family of ternary edge feature extractors 
as shown in fig. 2 is used as the predominant image process- 
ing tool in the context of the 4-D interpretation scheme; it 
has been developed over a decade of work in real-time 
dynamic scene understanding^ The mask parameters 
are dynantically controlled by the object recognition process 
taking the 4-D representation and perspective mapping into 
account, thereby realizing a fast feedback loop from high- 


level interpretation to low-level feature extraction; it is 
especially this feature which makes the tracking phase so 
efficient. 



Fig. 2; Operator family for edge feature extraction 

Representation in 3-D space and time directly: No basic 
representations are performed in the image plane; at the 
earliest time possible it is tried to jump from a feature 
distribution supposed to belong to a single object to an 
object hypothesis in 3-D space and time. Since perspective 
projection is the link between spatial shape, relative orien- 
tation as well as position, and the shape in the 2-D image 
plane, always both object sliape and aspect conditions have 
to be hypothesized in conjunction. Shape invariants of mov- 
ing rigid bodies are in 3-D space and not in the image plane; 
simple motion descriptions also are more easily encoimtered 
in 3-D space than in tlie image plane where both motion and 
shape together y 'vdd relative image feature distribution from 
frame to frame. In addition, motion beliavior in 3-D space 
may be as characteristic for an object as its shape; this leads 
to the third essential element: 

Orientation towards physical objects: Knowledge about 
tlie real world is attached to objects which serve for struc- 
turing complex scenes. Similar properties or shapes lead to 
the definition of object classes characterized by generic 
forms and functions; other attributes may be appended 
depending on the task at hand (e.g. color, texture). For 
subjects, defined as objects with the capability of self-in- 
itiated locomotion^ stereotypical motion characteristics 
may give independent cues to recognition besides static 
shape. In general, tlie centroid of features from an object 
yields information for translational motion, wliile rotational 
motion and shape may be derived from systematic changes 


485 





of feature positions around the centroid, that is from dif- 
ferences between feature positions in the image. 

Once an object and its motion has been recognized, the 
continued observation can be made much more efficient by 
tlie fourth basic element: 

Expectations and prediction error feedback: The dy- 
namical model of a process (e.g. a moving rigid body) may 
be given to first order by the vector difference equation 

x[(^t+l)7] ^A{k) x[kT\ + BQi)u [kT\ + v[/c7], (1) 

where x is tlie state vector of dimension n, k is tlie time index 
for the actual state, T is the cycle time, A is the state 
transition matrix («•«), B the control effect matrix u 
the r-vector of control variables, and v represents process 
noise with covariance matrix Q. Predictions, of course, arc 
made disregarding the noise term. After prediction tlie time 
index k is increased by 1 and from the predicted state x in 
combination with the shape description the features to be 
measured in the next image are obtained from applying the 
forward perspective projection equations; for this purpose, 
first, the spatial positions and orientations of edge elements 
have to be computed by combining position and angular 
orientation of the body-fixed coordinate system having its 
origin at the object center with the shape description in these 
coordinates. 

Then, the perspective mapping equations containing all 
translations and rotations beti\'een the body-fixed and the 
camera coordinate system (in x ) as well as the camera 
parameters p have to be applied to all well visible edge 
features in order to obtain the predicted (horizontal and 
vertical) feature positions in the image: 

/ = /z(xV), (2) 

where dim. (y ) depends on kind (edge or comer) and number 
of features. It is assumed that the error between predicted 
(y ) and actually measured feature positions (y) is so small 
that a linear approximation to eq.(2) captures the essential 
part of the dependencies between y and x: 

del_y ^ dh(x* ,p)/dx*^(x - x*) = Odel_x, (3) 

where C is the Jacobian matrix of all first order partial 
derivatives linking state component changes to feature shifts 
in the image plane. Because of the richness in information 
contained in this matrix and the central role it plays in the 
4-D approach, it will be discussed below as the fiftli basic 
element. 

The actual measurement data y from feature extraction 
will be cormpted by measurement noise w (both from the 
video signal and from image processing); this noise is 
assumed to be imbiased and white with covariance matrix R 


so that the measurement model may be written 

yzzh(x,p)‘hw, (4) 

In order to adjust tlie internal 4-D representation to the 
process being observed in the real world, prediction error 
feedback is used according to the recursive estimation tech- 
niques^^ derived from the Kalman filter^ and its exten- 
sions^ ^ The new best estimate for the relative object state 
X is obtained by adding to each predicted state component 
weighted elements depending on the measured prediction 
error; tlie weights arc determined by the so-called Kalman 
gain matrix K (or its equivalents in the sequential scheme 
discussed below) taking the noise characteristics Q and R 
(confidence in both the underlying process model and the 
measurements) into account: 

$ = (5) 

Note tliat no special provision is made for perspective 
inversion; the least squares core of the algorithmic proce- 
dure for computing the elements of K takes care of perspec- 
tive inversion hidden in tlie prediction step and the Jacobian 
matrix C. Gain computation is not detailed here for brevity ; 
because of occlusions, varying aspect conditions and pertur- 
bations in the imaging process, the length of the measure- 
ment vector will change steadily. In order to accomodate this 
easily, the measurement update is made sequentially for 
each component; this also saves computing time and has 
been a standard feature of the 4-D approach from the begin- 
nin^^ 

From tills possibility it can be seen immediately that an 
update of all state components can be made from just one 
single measurement input; this may look like magic for 
people grounded in direct perspective inversion. Though 
this capability is true - substituting knowledge about the real 
process for missing data -, too few measurements over an 
extended period of time will lead to drifts in some state 
components poorly observable from this measurement, or 
due to model errors as compared to the actual process 
observed. However, in spite of this fact the value of this 
property based on the 4-D model can liardly be overesti- 
mated for bridging short periods with insufficient measure- 
ments for what cause soever. Even periods witliout any 
measurements may be bridged by pure predictions (vanish- 
ing second term on right hand side of eq. (5)). 

An important point resulting from prediction is the ca- 
pability to efficiently direct image processing by confining 
attention to smaller subareas of the image, and by providing 
information on wliich algoritluns may be most economical 
in the next image (e.g. mask orientation for edge clement 
extraction). 

Central role of the Jacobian matrix: Wiinsche^^ de- 
veloped methods for exploiting the entries into the Jacobian 
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matrix for other purposes beyond simple recursive state 
estimatioa When computing power is limited, there usually 
are many more features available for extraction and state 
estimation than can be handled by the system available or 
from a price / performance point of view; in this situation it 
is very essential to be able to automatically select tlie most 
rewarding set of features yielding best estimation results in 
limited time. The entries into the Jacobian matrix, balanced 
for poorly compatible units for tlie state variables (like 
positions in meters and angles in degrees), allow to concen- 
trate computing power for image evaluation in tliose areas 
of tlie image and onto those features by which best accuracy 
is achieved. 

Small values of elements in tlie balanced Jacobian indi- 
cate that the corresponding state component hardly effects 
the corresponding feature position; if an entire column has 
small values this means that the corresponding state compo- 
nent hardly affects any measurement quantity; therefore, it 
can not be expected tliat this state component can be re- 
covered accurately from the values measured. Likewise, if 
an entire row has small entries this feature is almost constant 
and independent of state changes; this feature is not well 
suited for updating the state vector and may well be elimi- 
nated from further processing. 

For example, for a rectangular box looked at along a 
center line almost parallel to four edges (so tliat the image 
is a rectangle) it is not possible to recover the exact viewing 
angle because of the cosine-effect involved. If the size of tlie 
box (width, height and length) has to be determined in 
addition to the relative state, the dimension in viewing 
direction, of course, cannot be recovered; if the box is turned 
by 90° this size component is well determinable now wliile 
another one , now pointing in viewing direction, can no more 
be iterated. In the general case, the actual aspect conditions 
determine which state components or shape parameters can 
be observed and which ones have to be frozen until the 
corresponding entries in the Jacobian matrix become large 
enough again. This determines the perception strategy and 
-management. 

In summary it can be stated that the efficiency in image 
sequence understanding by the 4-D approach is due to the 
frequent bottom-up and top-down traversal of the repre- 
sentation hierarchy; this is done each cycle of rather short 
duration so that the correspondence problem is not too hard. 
The linear differential models allow to tap well proven 
system theory. However, this only works for the continuous 
tracking phase. 


Initialisation versus tracking 

It is almost impossible to say something meaningful in 
general to the initialisation problem since it depends very 
much on the task domain and on the knowledge available to 


the sy'stcm. For this reason, the tracking phase for specific 
task domains has been developed first; as expected, once 
this capability has been available to a certain extend, it 
turned out to be relatively easy to jump from feature aggre- 
gations discovered in an initial search pliase (with veiy low 
cycle times) to an object hypothesis for which Uie tracking 
capabilities are givem^. If stable tracking can be established 
and die prediction errors converge below certain tliresholds 
it is claimed that a motion process involving an object of the 
class instantiated has been discovered; if this is not the case 
die hypodicsis is rejected and a new one has to be tried until 
die set available is exhausted. Surprisingly many cases can 
be handled successfully by this procedure^^’^ ; however, 
many unresolved problems remain, especially when occlu- 
sions are involved. 

Again, for certain task domains like vehicle recognition 
on highways, many of these problem situations have been 
resolved by creating die capability to recognize partially 
occluded objects, even diose appearing from full occlusion 
like in lane changes of one of two vehicles in front^^; since 
only partial information is accessible in these cases, model 
based recognition involving knowledge about normal sizes 
of objects, about part hierarchies and about likely motion 
states is essential. 

One often hears die call for more systematic bottom-up 
hypothesis generation in the initialisation phase; this, of 
course, would be nice to have. However, considering the 
discrepancy in computing power required for diis type of 
initialisation in a somewhat complex realistic scene, and the 
one needed lateron for the tracking pliase it is conjectured 
that - even in the long run - the approach of jumping to 
(maybe several parallel) hypotheses relatively early and 
then do a critical evaluation over time exploiting 4-D models 
may be a sensible way to go. More experience in several task 
domains is necessary in order to answer this question in a 
solid way. 

The 4-D solution for complex tasks 

The basic principles discussed above for the single sen- 
sor, case cany over to multiple objects and multi-sensor 
systems^^’^'^. A general scheme for these types of complex 
real-time systems is given in fig. 3. 

As in the single object, single sensor case there is just 
one unifying mental representation for recognizing die situ- 
ation and for controlling action; however, for each object 
observed diere is a specific process (presendy implemented 
on a dedicated group of processors) with access to a corre- 
sponding knowledge base for this object class. In this 
knowledge base generic background knowledge is stored; 
besides physical properties with respect to motion (the 
elements of die dynamical model) specific properties with 
respect to the different measurement processes are stored. 
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Fig. 3: Multi-sensor, multi object recognition scheme by prediction error feedback 


The figure shows provision for four types of sensors, two 
imaging ones on a pan and tilt viewing direction platfomi, 
and two other sets, one for inertial data about the egomotion 
(lowest group in center) and one for other conventional 
sensors like odometer, tachometer, steering angle, throttle 
setting, brake pressure and range sensors or the like. 

The inertial sensors may directly feed dynamical models 
for separating egomotion from visually observed relative 
motion with respect to another object. Otherwise, vision 
does allow this separation only when besides this object also 
a third, static one can be observed simultaneously; accuracy 
and robustness may be much poorer without inertial data. 

For measurement signal interpretation, a Jacobian ma- 
trix has to be computed for each pair of object and sensor, 
in figure 3 tliis is indicated by the vertical arrows to the 
diagonal block in the lower right center. 

Contraiy to the conventional sensors, image sequence 
evaluation may yield measurement data on several objects 
in parallel; in fact, one of the difficulties in machine vision 
is the assignment problem of features extracted to objects in 
the scene. Grouping image sequence processing according 
to objects observed, as in the 4-D approach, therefore, 
requires a perception management subsystem shown also in 
tlie lower right center, Tliis is an area of actual research and 
development; good solutions to this problem will be crucial 
for high performance machine perception systems. 

In order to further decouple real-time fast and safe 
control from slower activities for situation recognition, two 
different time scales for image sequence interpretation have 
been introduced in our system. Besides tlie fast tracks for 
estimation of relative position, one each for each object of 


relevance, based on rather few features per object and work- 
ing at 25 Hz in the new TlP-sy stem, there is one subsystem, 
attention controlled from the higher levels, which runs at 
about 5 Hz and is capable of recognizing objects on a more 
detailed level; specialists for recognition of typical road 
veliicles^^ (trucks, vans, passenger cars) and of moving 
humans^^ (walking, running, bicycling and arm waving) are 
under development. 


Perception management 

Vision and inertial measurements do have nice com- 
plementary properties: Vision incurs long delay times be- 
tween data collection and object state estimation in general 
(100 to 300 ms typically, both in biological and in technical 
systems). In addition, because of tlie signal integration in the* 
basic sensing element, motion blur will occur in the image 
during faster rotation, yielding rate signals derived from 
image sequences unreliable; on the other hand, inexpensive 
inertial sensors may yield rather accurate rate signals with 
almost no time delay. These inertial sensors become expens- 
ive when provision has to be made for low drift rates (long 
term stability). Combining inertial sensors with vision 
allows to build a flexible system with good overall proper- 
ties: viewing direction may be stabilized by controlling the 
platform with the negative angular mte from an inertial 
sensor, thereby improving the conditions for image evalu- 
atioa Visual fixation of the viewing direction onto a well 
visible set of stationary features allows to solve for the 
inertial drift problem. 
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In bifocal vision, two cameras are mounted fixed 
relative to each other on a gaze control platform; witli 
one wide angle lens for a large viewing angle covered 
and one tele-lens for high resolution capability within 
a subarea of the wide angle image, there is a need for 
viewing direction control in a saccadic mode in addi- 
tion to the smooth pursuit mode for object tracking: 

If the tele camera tracks an object, and in the wide 
angle image a new object of liigher interest is dis- 
covered, viewing direction should be changed as fast 
as possible in order to center the new object in the 
tele-image. With the viewing direction control plat- 
form developed^^ a 20° saccade can be performed in 

about 150 ms; during this fast turn of four evaluation Selectable fast reflex like feedback control determination 

® with triggered feed forward components; situation 

cycles (440 ms = 160 ms) no useful information can dependent control mode 

be derived from the blurred images. Therefore, within 
these periods the internal representations have to be 
updated according to the 4-D model exclusively; only 
after slowdown below a certain angular rate in the 
vicinity of the coordinates aimed at, image feature 
extraction will start again with new predicted posi- 
tions and search ranges. Via a status bit this informa- 
tion is broadcast to all ’object processes’ together 
with the actual viewing direction. 

Another important point in perception manage- 
ment on the object level is handling of occlusions. It 
is not yet clear, how much of this should be done on 
the ’object process’ level and how much on the situ- 
ation level; both have to deal with the problem in 
parallel. On the situation level (upper right comer in 
fig.3) the semantics in the task context have to be 
taken into account; on the lower object level the 
problem is to decide in each evaluation cycle which Hierarchical scheme for adaptable fast control determination 

features belong to which object, how this attribution 

affects relative state estimation and what is the most Fig. 4: Situation dependent, kn^ledge based 

likely separation line between the two objects, intelligent control^ 

In addition, during this process the question has 

to be answered which state variables and which shape par- 4 shows the general scheme adopted in the 4-D approach to 
ameters are presently observable; the interpretation models intelligent autonomous systems based on state feedback for 
have to be adjusted correspondingly. Especially the relative fast reactive counteraction to perturbations and event-trig- 
viewing angle (in azimuth) of vehicles aliead changes ob- gered feed-forward control adaptation to a new situation, 

servability frequently in typical highway traffic situations'^; both managed by knowledge based situation assessment and 

the problem of veliicle length estimation has already been behavior selection; by puq)ose, this is not called behavior 
referred to above. planning since the generic, well proven behavioral capa- 

bilities are available (or may be learned in more advanced 
versions) and are just invoked with the right set of par- 
Intelligcnt control ameters. The actual control laws, of course, are specific to 

the task at hand. 

The own body carrying the camera is now always repre- 

sentedasanobjectoftherealworld(numberlinfig.3, lower The rest of the paper gives a survey on the different 
right). Since Newtonian motion is of second order in each application areas to which the 4-D approach has been suc- 

degree of freedom the state vector also contains all velocity cessfiilly applied; it is grouped according to the task fields: 

components in 3-D space; this allows state vector feedback Rendez-vous and docking, surface vehicle guidance, and 

for achieving some goal function in an optimal way. Fipre dynamical grasping in 3-D space. 
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Rendcz~vous and docking 

Most of the methodical developments of the 4-D ap- 
proach liave been performed by Wuensche^^ on the single 
sensor, single object problem of controlling planar motion 
in three degrees of freedom of a tabletop aircusliion vehicle 
with reaction jet control relative to another 3-D body (satel- 
lite model plant in the laboratory). The real-time system lias 
been controlled by a VAX 750 combined with a custom- 
made 8-bit image sequence processing system B Wl^. The 
system performed a self-calibration of the horizontal mount- 
ing direction of the camera relative to a docking rod for final 
mechanical fixture. All six state variables relative to the 
docking partner of known polyhedral shape, and the vertical 
mounting direction of the camera have been estimated con- 
tiniuously by tracking four comer features. Which ones of 
the usually eight visible features should be selected for 
tracking in order to achieve optimal accuracy of relative 
position estimated, has been decided by the system itself 
exploiting the entries into the Jacobian matrix. While the 
usage of modified Kalman filters has found very wide 
acceptance in the vision community in the meantime, the 
more detailed exT)loitation of the information in the Jacobian 
matrix does not seem to have been appreciated correspond- 
ingly. 

A somewhat different type of rendez-vous with one 
relative state component (horizontal speed) appreciably dif- 
ferent from zero (about 200 km/h in the actual example 
flown) is the landing approach of an aircraft ; in this spatial 
maneuver the number of state variables is doubled to twelve 
and there are four analog control variables instead of the 
three discrete ones witli the satellite. Large perturbations 
may occur due to wind gusts; therefore, inertial sensors (rate 
and position gyros as well as accelerometers) have been 
important for robust recognition of the relative position to 
the mnway over the last 1 to 2 Km during approach. For 
initialisation, signals from a Differential Global Positioning 
System DGPS have been used^®’^^ Thus, the guidance 
system may be classified as multi-sensor, single object 
(besides the own body, of course). 

The system has been developed over a period of more 
than a decade^^, starting from simple all-software-simula- 
tions. Lateron, in moving base simulations (three rotations) 
with computer generated imagery and the real sensor and 
computer hardware in the real-time loop, fully automatic, 
on-board autonomous landing approaches (including side- 
winds and gusts) until touch down have been demonstrated; 
this type of flight simulator for machine vision autopilots 
seems to be the only one in operation up to now. 

Real flight experiments have been performed in 1991 
with a twin turbo-prop aircraft DO 128 of the University of 


Braunschweig; the human pilot was in control, but the 
perception system estimated tlie complete state vector at a 
rate of 16 Hz. 

In tlie meantime, the hardware base for the system has 
been changed to transputers and a bifocal camera arange- 
ment; new flight experiments are planned for spring 94. 

Surface vehicle guidance 

Contrary to the developments in the US-DARPA ALV 
program, without having knowledge about these activities 
at all, we started from a behavioral approach based on the 
4-D method for continuous vision processes. No higher 
level Al-components have been involved on our side in- 
itially; the system was capable of recognizing local road 
environments and of reacting in such a way that certain 
predescribed behavioral parameters like speed, offset from 
a line or maximum lateral accelerations were observed. The 
capability of performing (elements of, or full) missions 
developed over time on this base. 


Local reactions for motion control 

Road runner: Taking the guideline model for the con- 
stmetion of high-speed roads as the essential knowledge 
component for recognizing a road recursively while driving 
on it, a substantial gain in efficiency for image sequence 
processing has been realized'"'^. Mysliwetz"'"’*''* expended 
this to robust road recognition, including the general case of 
hilly terrain, by applying the ’Gestalt’ -idea -known from 
psychology- to road shape recognition from a large number 
of (approximate) tangent elements. In the classification 
scheme discussed, this work till the end of tlie 80ies be- 
longed to the single sensor, single object group. For more 
demanding real world applications, especially high speed 
driving, it turned out that a combination of cameras with 
bo til small and large focal length, termed ’bifocal vision’, is 
desirable; this combination has been in use for years, but 
with separate signal evaluation for different objects and 
purposes. Recently, the signals from botli cameras have 
been used for recognizing the one object road in a joint 
evaluation^^ (’two sensors, one object’- case; upper central 
part infig.3). 

Intelligent cmise control: Adding to tliis lane keeping 
capability the one for obstacle recognition, relative state 
estimation and relative state control^’^"^, within the same 
framework of event-triggered feedback and feed-forward 
behavior selection as shown in fig.4a, remarkable perfor- 
mance sufficient for driving on ’Autobahnen’ in normal 
traffic situations has been achieved^"^. 

The reactive feedback scheme (lower level in fig. 4a) is used 
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for realizing: 

lane- and speed-keeping with speed adjusted to 
horizontal curvature such that a preset lateral ac- 
celeration level is not exceeded; 
convoy driving behind another vehicle with dis- 
tance depending on speed driven (2 seconds rule); 
this includes 'stop & go* driving in traffic jam as a 
special case. 

The event-triggered feed-forward scheme (upper level in 
fig.4a) provides tlie capabilities for: 

transition from the unconstrained cruise, lane keep- 
ing mode to the convoy driving mode (including 
stop in front of an obstacle); 
lane changing to left and right. (At present, the 
human driver has to check whether the intended lane 
is free; in response to an inquiry he triggers the lane 
change by hitting a key on the board.) 

More than 2 000 km have been travelled autonomously in 
normal Autobahn-traffic since September 1992 with the two 
veliicles VaMoRs of UniBwM and VITA of our industial 
partner Daimler-Benz. 

These results on the two lower levels of fig.4b are 
achieved essentially with differential representations and 
local considerations; a different situation occurs when glo- 
bal points of view in a task context come into play. The 
upper level in 4a (the medium one in 4b) forms the transition 
from strict, local reactive control to global mission perfor- 
mance. 


Global performance (mission control) 

The result of an application of a stereotype feed-forward 
control time history is a state change with roughly predict- 
able differences between initial and final conditions; this 
so-called ’maneuver element’ may be labeled by a symbol 
and stands for the finite state transition as an integral repre- 
sentation of this control sequence (e.g. ’lane change’ : Same 
driving conditions except for a lateral offset of one lane 
width). 

Maneuver elements may be generic by specifying some 
parameters which modify the control sequence; for 
example, lane change may be specified as smooth or rough 
by fixing the maximum lateral acceleration limit in an 
otherwise structurally fixed control sequence; this is equi- 
valent to specifying the maneuver time allowed. On the 
Autobahn, all navigation is done by proper lane changes and 
lane following; this makes mission performance rather 
simple. Besides the well structured environment, this was 
one of the reasons for choosing Autobahn-driving as tlie first 
field of application for practical machine vision at the end 
of the 70ies. 

When the capability of reading traffic and navigation 


signs on tlie Ajatobahn is added to the existing system, this 
goal will be achieved. 

Landmark navigation on the factory floor is much more 
demanding, though in case of failures the damage possible 
is much less because of the low speeds driven. Taking doors, 
well visible features on workbenches and other rectangular 
markers with special height-to-width ratios as landmarks, 
the suitability of the 4-D approach for real-time visual 
landmark navigation has been demonstrated in 1991 with 
moderate computing performance available^^ with an AGV 
in a laboratory environment and in a factory hall. 

Driving on road nets with an automobil in an autono- 
mous mode performing an abstractly defined mission is the 
most demanding task demonstrated, though yet far from 
robust real-life applicability. The system shown schemati- 
cally in fig.5 is in the final stage of development for this 
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• Bifocal camera pair: tele, wide angle with active viewing direction 
control 

• Object-related, intelligently controlled feature extraction 

• Recognition of moving objects exploiting spatio-temporal 
models (4D) 

• Situation analysis in task context (Al-Aspects) 


Fig. 5: Transputer based system for autonomous 
mission performance 

purpose: the lower line of blocks is formed by object-spe- 
cific processor groups each consisting of 16-bit processors 
for feature extraction and 32-bit processors with floating 
point units for recursive state estimation; usually, these 
groups work on separate areas of the image for which they 
make their own predictions. Occlusions have to be dealt with 
in cooperation between such processor groups. 

All object related data are exchanged via a dynamic data 
base (DDB) which always contains the most recent esti- 
mates of object states and parameters. The higher levels of 
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the system shown above the DDB incorporate situation- and 
control- specific knowledge for fmding the best behavioral 
mode in the actual situation. Both fast reacting state feed- 
back control laws and event triggered feedforward control 
time histories (as discussed for lane changing but also for 
turning off onto a cross-road) may be applied. A landmark 
navigation component has been added on top of this for fully 
autonomous mission realisation^^. 

The system, in a different mode of operation, may also 
be used for monitoring and warning when the human driver 
is in control of the vehicle . 


Dynamical grasping in 3-D space 

Quite a different application of the 4-D approach has 
been demonstrated in May of 1993 during the Spacclab 
mission D2 with the Space-Shuttle Columbia, One of the 
experiments on board was the RObot Teclmology EXp eri- 
ment ROTEX of DLR, Germany; in this set of tasks under 
tlie direction of G.Hirzinger one of the tasks was to grasp an 
object freely floating in space in a confined woikcell for 
safety reasons. The relative position between the object and 
the six-degree-of-freedom robot arm was to be determined 
from a camera in the hand of the robot; one of the difficulties 
was that the computers for visual interpretation and control 
had to be on the ground. Due to the routing via three 
geostationary satellites and quite a few groundstation com- 
puters the lumped delay time from measurerucnt taking until 
the control signal derived from these data again arrived on 
board the Spacelab was around six seconds! 

This-time delay has been compensated by exploiting the 
dynamical models for the object to be caught and for the 
robot arm. On May 2nd, 1993, this maneuver has been 
performed by ’remotely -brained machine vision’ automat- 
ically after initialisation of visual tracking by a hum^m 
operator^^. 

Conclusions 

The development of the 4-D approach to dynamic ma- 
chine vision continues to be successful. Spatio-temporal 
models oriented towards physical objects together with the 
laws of perspective projection in a fonvard-modit (and as 
approximate linear relationships between the states or par- 
ameters of the physical objects and the features by which 
these objects may be visually recognized) are tire core 
elements of the method. The spatio-temporal models as 
invariants for object recognition also serve for integrating 
multi-sensory’ measurement data. 

By prediction error feedback an internal symbolic 4-D 
representation of processes involving these objects is being 
maintained allowing situation assessment and longer term 


predictions. 

For specific tasks, behavioral capabilities can easily be 
realised by state feedback or feed-forward control. The 
internal feedback loops from state prediction to measure- 
ment activities in the image plane make interactions be- 
tween the liigher and lower processing levels veiy efficient. 
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ABSTRACT 

In this paper a fusion technique is presented for the 
3-D modeling of the free navigation space. The fusion 
technique could be used by an autonomous robot to 
model and acquire the navigation space and for the 
extraction of the 3-D map from the environment. The 
fusion technique is based on the appropriate synthesis of 
two different sensory data generated by a vision camera 
and a laser scanner. 


KEYWORDS: Fusion, Image Segmentation, Laser 
Scanning, 3-D images 
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1. INTRODUCTION 

Humans and animals are born with multiple 
senses that allow them to develop a very clear picture of 
our world. Thus, in the design of autonomous robots, or 
robust vision systems, the cooperation of the multiple 
sensors is needed to enhance the system's chances of 
success in a complex environment. Of particular 
importance is how the robot formulates a 3-D image of 
the environment. We intend on using the fusion of a 
color segmented picture and a laser range finder to 
develop a working model of the space. 

The idea of fusing two sensory signals to 
formulate a 3-D image is by far not new. Extensive 
work has been done in this field of machine vision. We 
encourage the reader to further enhance their 
background by looking at (I] and [2]. 

Several very interesting approaches have been 
done in the field of fusing intensity and range data. Of 
particular interest is the work done by J.K. Aggarwal et. 
al. [3]. 

In particular in [3], a method of fusion between 
range data and intensity is presented. One major goal of 
this approach was to minimize the extensive amount of 
time required in sensing the range data. Using potential 

Copyright © 1993 American Institute of Aeronautics and 
Astronautics, Inc. All rights reserved. 


points of interest from the intensity data as directors for 
the range sensor, the range sensor senses those points of 
interest and then the range data and intensity data are 
combined to form the graph. 

Another method is discussed in [4]. This method 
uses the most dominant sensor at one time (range or 
intensity) for seed segmentation. This segmented area 
then again uses the data from the range and intensity 
for region merging. The authors used this method to 
minimize the lighting difficulties and color variance 
(intensity image segmentation), and to minimize the 
difficulties range image segmentation has with sloped 
surfaces. 

Other related work is available in {5], [6], and [7]. 

It is interesting to be mentioned that the laser 
scanner has very high resolution (similar to a digital 
image) very close to the scene surface. This means that 
in longer distances, such as 1 m or longer than that, the 
resolution changes drastically. For instance, in a distance 
of 1 m, the size of the laser spot has diameter of 1 cm 
approximately. Thus, how the image 3-D model will be 
developed under these conditions. 

We propose a methodology, which hopefully will 
correct some of these problems, such as long distance 3- 
D image modeling and color difficulties, by using the 
intensity data as the director for the laser. More 
specifically, we use a fuzzy image segmentation 
technique [9] for the sufficient separation of the image 
regions with different color. In addition, we generate an 
image with a low resolution equivalent to the laser one, 
thus the fusion of these sensory data to be possible and 
the final 3-D image to contain less "noise" on the 
scanned surfaces. 

This paper is organized into five sections. Section 2 
discusses the laser range and some of the existing 
problems. Section 3 presents briefly the fuzzy image 
segmentation technique for the separation of the regions. 
Section 4 describes the fusion methodology of these two 
different sensory data. The last section summarizes the 
overall presentation. 
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2. LASER RANGE DATA ACQUISITION 


Ideally we would like to scan the picture area with 
a laser scanner. This would provide us with a range 
map that could be "fused " with our intensity segmented 
map. However the inhibitive high cost of laser scanners 
is not within our university's budget. 

That leaves us with two options; we can attempt to build 
our own laser scanner or we can purchase a point by 
point laser range detector that can be manipulated in a 
xy plane. Due to the time constraints of this paper we 
will assume that we have the funds to purchase a point 
by point detector. It is also worth noting that it would 
probably take us a considerable amount of time to build 
the complex circuitry required for a laser scanner. Our 
objective is to develop a new fusion technique. 

One company that has a variety of laser distance 
meters is RIEGL[8]. Features included in these 
detectors are RS233 interface (as long as measure times 
are > 50 ms), immunity to electro-magnetic and acoustic 
interference, high speed and accuracy, and a starting 
beam width of s icm with a divergence of 3.2 mrad. 
Since our robot is designed primarily for enclosed 
hazardous environments the laser sensor range does not 
need to be exceptionally long. The meter fulfilling our 
needs the best is the LD90-210-CX/C6. It's sensing 
range is from 1 meter to 15 meters. In addition [8] also 
has a biaxial rotary mount (BD 90). The mount enables 
us to selectively pick the intensity segmented areas. 

Theoretically a beam width the size of a pixel 
would be ideal. This would permit a one to one fusion 
with the pixels form the intensity segmentation. 
However technology has not yet reached this minute 
beam width. The authors of this paper are aware of 
very small diameter beam widths but these lasers are 
not yet available. 

After the separation of the intensity scan into 
specific colors the area of each color segment is 
determined and put in xy coordinates. Once an area is 
defined the bidirectional mount is manipulated so that 
the laser range meter can determine the range of the 
area in question. When the scanning of the area is 
complete another color area is scanned until the entire 
intensity frame has been scanned. The intensity map 
and the range map are then fused to form our first 
attempt at the 3-D image. 

One of the critical features of the laser scanner is 
the non-uniform scanning of the scene. More specifically, 
since the mechanical XY movement mechanism used by 
the laser scanner is imperfect, there is the possibility 
that the laser beam to overlap two consecutive scanned 
point, or to be far away from each other, as shown in 
figure 1. A solution to this particular problem is the 
scanning of the same scene area twice, and the 
averaging of two distance values by reducing the 
possible error. 



Figure 1; The imperfect scanning 


3. FUZZY IMAGE SEGMENTATION 

Segmentation and edge detection in color images 
are not deeply investigated in the literature. Some 
methods separate the three color components and work 
on each independently. Three different histograms are 
drawn and after segmentation or edge detection, the 
results are combined to form segmented or edge 
detected color images. The results using these techniques 
are false because of three reasons: one, segmentation or 
edge detection based on color alone is not complete. The 
relative position of the pixels and their color in their 
neighborhood are important as welL Two, in real images 
there is no sharp valley in histograms. Three, once one 
pixel color is mapped to three locations on three 
histograms, pixel color information is scattered. And 
once this is done for thousands of pixels, all available 
information is scattered and mixed. After thresholding 
doing the reverse and putting the three components 
back together is not an easy job. This technique is 
usually used for images having a limited number of 
segments or clusters. 

The technique described in this section is 
applicable to both color and gray level images having an 
unlimited number of clusters. A cluster is defined as a 
collection of touching pixels which have almost the same 
color or the change in color is gradual. Since an 
unlimited number of clusters could be detected from an 
image, displaying each cluster in a different image 
would not be practical. What we have done is displaying 
all clusters in one image. Each cluster is shown using a 
color that reflects the main color of the cluster. 

Before segmentation can begin a rough edge 
detection must be performed. The used edge detection 
technique is not described here but it is sensitive enough 
to detect all sorts of edges (crisp, fuzzy, thin, and week). 
A histogram table is also generated and sorted according 
to the number of pixels. The first three entries of the 
table for a particular image are shown below: 


21 27 23 892 
21 26 23 640 
8 12 28 591 

The numbers represent red, green, blue, and the 
number of pixels in the image having that particular 
color vector, respectively. Segmentation is performed in 
the following steps: 1- find big clusters, 2- expand 
clusters, 3- find medium size clusters, 4- expand clusters, 
5-find small clusters, 6- fill in the blanks. 


where R, G, and B are the three color components. 
Variance membership function is shown below: 




0 if Variance < 

1 if Variance^^ > 


Big clusters are defined as those having a 
minimum of 150 pixel members. To find big clusters 
instead of looking for valleys in the histogram we look 
for peaks. The first peak is the first entry in the 
histogram table. Therefore, the seed color is known and 
the seed location must be found. The image is scanned 
for that particular color vector and once found it is 
test-grown and pixels marked until a detected edge is 
reached and the size of the test-grown cluster is noted. 

1 his is done for all unmarked pixels having the seed 
color. The seed location that uses the most of the seed 
color pixels and is big is chosen as the seed location and 
its color as the cluster color. The seed is then grown. 
While the seed is growing and pixels are added to the 
cluster, each pixel is marked and its entry in the 
histogram table decrement. Meaning, that particular 
pixel is not available any more and does not contribute 
to the histogram table. When the seed is all grown, the 
histogram table is resorted and the first entry, now, is 
the second peak of the histogram. This growing process 
is repeated until there are no more big clusters left. 

Since the edge detection algorithm used is very 
sensitive to variance, the detected edges are thick 
especially in the case of fuzzy edges. Therefore, big 
clusters must be expanded using fuzzy logic to fill out 
that gap before smaller clusters are found. Big cluster 
expansion is performed in two steps: In the first step 
each cluster is expanded (surrounding pixels added to 
the cluster) based on two rules: 

1. The absolute variance (difference between the 
pixel's color and the seed/cluster's color) is low. and 

2. The relative variance (difierence between the next and 
the previous pixel's color) is low. This is used to 
include gradual changing shade areas as well. 

In the second step each cluster is expanded based on the 
following rule: 

For each pixel the absolute variance with the 
surrounding clusters and the distance (in number of 
pixels) with that cluster is evaluated and their product 
recorded (degree of farness). If the lowest product value 
(which is for the closest cluster in terms of color and 
distance) is lower than a threshold, the pixel is added to 
the cluster. 

The following equation is used to calculate 
variance between pixel 1 and 2. 


Variance^^ = -/?,)'+ (G, - 



Averaging is used in anding the two fuzzy membership 
functions. 

Medium size clusters are found the same way big 
ones were but this time they are grown based on the 
first step of expansion rules (minimum size 30 pixels 
total). They are then expanded based on the second step 
of expansion rules. 

To find small clusters, for each undecided pixel 
in the image the degree of farness with the nearby 
clusters is evaluated. 

If the lowest value is higher than a threshold, use 
the pixel as a seed and test-grow it based on the first 
step of expansion rule. If the test-grown cluster has at 
least 6 pixel members and has a big contrast with the 
closest cluster (its existence is of importance), grow it, 
otherwise leave it undecided. To fill in the blank for all 
undecided pixels the closest cluster (in distance and 
color) is found and the pixel is added to that cluster. 
Figure 2 shows a colored image and its segmentation. 
Figure 3 shows a grey image followed by its segmented 
one. 


4. THE FUSION METHODOLOGY 
4.1. Methodology 

The fusion methodology presented here is based 
on the synthesis of different sensory data generated by 
a vision system and a laser scanner. In particular, the 
camera receives an image from the environment. The 
digital form of the image P(ij), KiJ<n, where "n" 
represents the size of the image matrix, is stored in the 
main memory. The digital image represents a 2-D array 
of gray level values or color values (gv(ij)) for the image 
pixels. At the time that the image is grabbed by the 
camera, the laser scanner "scans" the same area of the 
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Figure 3: a) A grey level image 
b) Its segmentation 


A colored image 
Its segmentation 
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environment by producing an 2-D array D(k,m). The 
values d(k,m) of the D array represent the distances of 
the laser scanner from the surface of the current 
navigation space at the particular area from which the 
image was grabbed. The D array is also stored in the 
memory. From this point, the fusion technique takes 
place. More specifically, a fuzzy segmentation technique 
is applied on the image values to define the regions with 
"color" similarities. An edge detection technique is also 
applied on the same image to define and separate the 
boudaries of the image regions |9]. Since the resolution 
of the P image-matrix is higher to the D distance-matrix, 
the resolution ratio (R(F)= [R(P)/R(D)1) is calculated. 
This R(F) ratio will guide the image pyramid generation 
technique for the selection of the pyramidal level of the 
image (PL) with the closest resolution to the D matrix, 
figure 4. 



At this point, the fusion matrix F is generated by 
combining the coordinates of these two matrices PL and 
D respectively. More specifically, the elements of the F 
matrix have four values: the x* and y* coordinates, the 
d distance of these x'y' coordinates, and the gv grey 
level value which cooresponds to x*y*; f(x*,y*,d,gv). The 
3-D image model (F3) for the original image will be 
generated by the F matrix with the expansion of each 
f(x*y’,d,gv) element by R(F) pixels. More specifically, 
f(x',y',d,gv) — > S{p} ^ 

where S represents the set of 4 neighbor pixels, so 
that R(F)% 4^ . Thus, 4^ neighbor pixels of the 3-D 
image model will share the same distance d; 
p(i+wj+z,d,gv), where w,zc{l,2,...2^ }. 

At this point, the colored image regions separated by the 
segmentation method wiU be used for "smoothing" the 
possible unevenness of the 3-D model More specifically, 
there will be cases where the pixels, which belong to 
different colored regions, share the same distance d. In 
this case, the distance d(Rc) which represents the 
majority of the pixels in the region Rc, with color "c", 
will be used to redefine the distance of those pixels, 
which share a distance different than d(Rc). 


Figure 5: A grey level image 



Figure 6: The reduced image using pyramidal 
procedure . 


4.2. Hhistrativc Example 

In this subsection we present graphically the fusion 
methodology by using a simple grey level image, figure 

5, the reduced image using the pyramidal process, figure 

6, a generated distance matrix with equivalent 
resolution, figure 7, the fused 3-D image at the reduced 
size, figure 8, and the 3-D model of the original image, 
figure 9. 



Figure 4: The pyramidal reduction of 
an image 



Figure 7: The distance matrix (in a 3-D form) 
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5. CONCLUSIONS 

In this paper a fusion methodology has been 
presented. This methodology combines two different 
type of sensory data for the generation of a 3-D image 
model. The sensory data used by this method were the 
distances produced by a laser scanner and the pixels 
average intensities (or color) of a 2-D digital image 
grabbed by a vision camera. The problem of the 
different resolutions for these two sensory data was 
solved by the reduction of the image resolution, the 
fusion of the different data and the reconstruction of the 
original image by using a fuzzy image segmentation 
technique. Since the laser scanner was available for this 
experimental work, only feasibility simulated results^ 
were presented. 
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Abstract 

This paper describes current work on a cooperative 
tele-assistance system for semi-autonomous robots. 
This system combines a robot architecture for lim- 
ited autonomous perceptual and motor control with 
a knowledge-based operator assistant which provides 
strategic selection and enhancement of relevant data. 
The design of the system is presented, together with a 
number of exception-handling scenarios that were con- 
structed as a result of experiments with actual sensor 
data collected from two mobile robots. 

Introduction 

Traditional telerobotics research focuses on the sep- 
arate contributions of the human, computer and robot 
in the performance of a particular task. Historically 
a single human at a local system is dedicated to op- 
erating a single remote robot. However, continuous 
supervision may be impractic8tl for applications where 
the communication bandwidth acts as a bottleneck, 
and/or transmission time delays make remote high- 
dexterity control difficult or impossible. Further, the 
operator may not be reliable due to fatigue, while 
increased environmental complexity, use of multiple 
sensing modalities, or the addition of more robots all 
may contribute to the cognitive overload of the oper- 
ator. 

One approach to these problems is to increase 
robotic autonomy through the addition of intelligent 
capabilities. Control schemes, such as shared control 
and supervisory control, reduce both the amount of 
communication between local and remote, and the de- 
mands on the operator by increasing the autonomy of 
the remote. However, there is still a need for human 
problem-solving capabilities, particularly to configure 
the remote for new tasks and to respond to unantic- 
ipated situations. Thus the teleoperations commu- 
nity is becoming increasingly interested in comput- 
erized assistance, both for the effective filtering and 
display of pertinent information or data, and also for 
the decision-making task itself. 


The purpose of this paper is to describe current 
work on a cooperative tele- assistance system which 
combines the autonomous perceptual and motor con- 
trol abilities of the Sensor Fusion Effects (SFX) ar- 
chitecture [4] with the intelligent operator assistance 
provided by the Visual Interaction Assistance (VIA) 
system [5], In this approach, the remote system con- 
sists of a robot with sufficient computerized intelli- 
gence to function autonomously under a particular set 
of conditions, while the local system is a cooperative 
decision-making unit that combines both human and 
machine intelligence. The computerized aspect of each 
system enables communication to take place in a com- 
mon mode, and in a common language. 

The paper presents an overview of the design of 
the system itself, and then discusses a number of ex- 
periments and scenarios using data from two mobile 
robots. The scenarios demonstrate how the local tele- 
VI A system would assist an operator to respond to a 
sensing problem which could not be resolved by the 
exception handling mechanism of the remote robot. 

Background and Related Work 

Coiffet and Gravez [2] suggest that “instead of 
searching for an overall model including complex con- 
cepts as human behavior, it is more profitable to con- 
sider the computer role as performing assistance func- 
tions to humans”. They go on to describe three sit- 
uations in which conceivably computer and human 
can collaborate, and compare the differences between 
telepresence, “a system-oriented assistance centered 
on machine transparency”, and teleassistance, which 
refers to “task-oriented assistance”. They state, fur- 
ther, that: 

Increasing the capabilities, and therefore 
the complexity, of a teleoperation system 
soon results in a cognitive overload for the 
human operator. The design of a cooperative 
system should thus introduce strategic assis- 
tance forms that facilitate on-line symbolic 
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control. Generadly speaking, the basis for 
such assistance is the selection and process- 
ing of relevant data (sensor outputs and ex- 
ecution reports) and the filtering of operator 
commands. This is reflected in a high-level 
structural dialogue embodying task-oriented 
diagnosis and proposing pertinent solutions. 

At the symbolic level, the computer contri- 
bution is deduced from an understanding of 
the work at hand, and is intended to enhance 
the human operator’s decision-making pro- 
cess and to improve on-line human-machine 
communication. 

The work presented in the remainder of this paper is in 
keeping with the philosophy behind these comments, 
and demonstrates how such a cooperative system may 
be designed and implemented. 

Approach 

In remote and unexplored environments where 
unanticipated events are likely to occur, it is impor- 
tant that the robot be able to handle a number of sit- 
uations autonomously and in real-time. In the event 
of an unresolved problem, however, the remote robot 
must have recourse to a local human operator for assis- 
tance. However, in order for the operator to perform 
diagnosis effectively, the data from the robot must be 
appropriately selected and presented. Once the prob- 
lem has been determined, the operator should then be 
able to re-configure the robot so that the autonomous 
processes may once again take over. An important 
consequence of this approach is that it is now conceiv- 
able for the operator to supervise more than one robot 
simultaneously, and furthermore, to request data from 
other robots working with the one in trouble. 

To achieve this goal of cooperative tele- assistance, 
two major software systems have been joined together 
and modified appropriately for this application do- 
main. The first is the Sensor Fusion Effects (SFX) 
architecture [4], which utilizes state-based sensor fu- 
sion to support the motor behavior of an autonomous 
robot. If a state failure occurs, fusion is suspended 
and control is passed to an exception-handling mech- 
anism, which attempts to identify the problem and 
either repair or replace the sensing plan. The sec- 
ond system, called VIA (Visual Interaction Assistant), 
is designed to cooperatively assist human perception 
and problem-solving in a diagnostic visual reasoning 
task. VIA is a blackboard-style system which utilizes 
knowledge-based techniques to focus the user’s atten- 
tion on relevant parts of the image, automatically en- 
hancing the image according to the needs of the user’s 
problem-solving process. It further manages diagnos- 
tic hypotheses, maintaining beliefs according to cur- 
rent evidence, and assists the user to converge oppor- 
tunistically on a solution where possible. 

The advantage of linking the SFX system with the 
VIA paradigm is that under SFX, the robot has al- 
ready attempted a certain amount of trouble-shooting 
itself. Thus information about what has been tried, 
the robot’s own conclusions, and the relevant sensor 
images can all contribute to the decision-making pro- 


cess of the local operator. In order to achieve this, 
the teleSFX system includes an interactive exception 
handling component, which allows the robot to call the 
operator for help in the event that its own exception 
handling capabilities could not resolve the problem. 

The central communication mechanism between 
the remote and local intelligent systems is the black- 
board structure. The advantage of this architecture is 
that it allows asynchronous communication between a 
number of independent knowledge sources. In the gen- 
eral VIA system, the user is considered to be a knowl- 
edge source as well, cooperating with the knowledge- 
based system in the search for a solution (or partial 
solution). In the case of the tele- assistance system 
(tele VI A), the remote robotic system is also incorpo- 
rated as a knowledge source, thus allowing the three 
entities, robot, teleVIA system, and human operator, 
to make contributions to the solution of the problem in 
a cooperative manner. This is especially important in 
cases where the solution set is not initially tractable, 
and more information must be acquired in order to 
quickly constrain the list of diagnostic hypotheses, so 
that a repair plan may be constructed. An overview of 
the entire system is shown in Figure 1, and further de- 
t^lils are provided in the following subsections. In this 
diagram, it can be seen how the interactive configu- 
ration and interactive exception handling components 
of the teleSFX architecture are merged with the in- 
telligent assistance provided by teleVIA, through the 
panels of the blackboard. The emphasis in this paper 
is on the interactive exception handling aspects of this 
design. 

TeleSFX 

In [3], the teleSFX control scheme was introduced, 
emphasizing the intelligent exception handling mech- 
anism at the remote. Unlike configuration, exception 
handling must be done in real-time (for example, a 
robot may be moving when a sensor malfunctions). 
As shown in [1], autonomous exception handling is 
difficult because it involves domedn and hardware spe- 
cific information which may not always be available or 
correct. 

TeleSFX uses a three part strategy for exception 
handling: deteciion, classification, and recovery. The 
first step, detection, determines that a “sensing fail- 
ure” has occurred. Sensing failures are any anoma- 
lous or suspect conditions that have been previously 
defined by the knowledge engineer. Sensor malfunc- 
tions are one type of failure. Most sensor malfunctions 
manifest themselves via explicit hardware errors com- 
municated to the controlling process (e.g., bus errors, 
frame grabber errors) and tend to be straightforward 
to classify and recover from (e.g., reset the system, 
request a retry). Another class of sensing failures is 
due to unanticipated changes in the sensing environ- 
ment which degrade the performance of one or more 
sensors (e.g., the lights are turned off). The third and 
final class of failures stem from errant expectations, 
where the robot is interpreting the observations ac- 
cording to a model. If for some reason the robot has 
selected the wrong model at the wrong time (e.g., for 
mechanical reasons, the robot did not rotate fully to 
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Figure 1: Cooperative Tele- Assistance System Design. 


the intended viewpoint), the sensor observations are 
unlikely to agree. 

Failures in the latter two classes are difficult to 
detect because the sensors are operating “correctly” 
but their data can no longer be interpreted with- 
out accounting for the changed context. Therefore 
teleSFX is sensitive to inconsistencies in the evidence 
contributed by different sensors for a particular task. 
The knowledge engineer defines a set of failure con- 
ditions representing these inconsistencies for the par- 
ticular implementation. Each perceptual process may 
have a different set of thresholds for those failure con- 
ditions, given the unique interactions between sensors. 

The classification step has the remote attempt to 
autonomously identify a sensing failure, and adapt the 
sensing configuration. This involves hypothesis gener- 
ation, testing and response heuristics at the remote 
site, and several experiments have been described in 
[1] which demonstrate this capability. However, the 
success of the classification step depends on the ex- 
pert understanding of the domain and the sensors. 
This domain-dependence means that classification by 
the remote is brittle and will not always be successful. 
Therefore, if the remote system cannot resolve the dif- 
ficulty, teleSFX must post the request for help to the 
blackboard, together with immediately relevant data 
such as current sensor data and a log of the remote’s 
hypothesis analysis. This signals the teleVIA system 
to activate its knowledge sources in order to request 
and present further data, as well as to perform further 


hypothesis analysis. 

Figure 2 shows the details of the control system for 
the remote site. The local operator is involved pri- 
marily in interactive configuration, and general moni- 
toring, until the interactive exception handling is trig- 
gered by the remote system. At that point, tele VI A 
takes over from teleSFX until the repair is communi- 
cated. 

Blackboard 

The Blackboard is where the evolutionary results 
of the problem-solving effort are captured. The orig- 
inal logical partitioning of the blackboard was based 
on components of a cognitive model of visual inter- 
action described in [5], and was designed to facilitate 
transfer of informat ion between human perception and 
problem-solving during a visual reasoning task. In the 
domain of tele- assistance, it is seen that, with one ex- 
ception, the same logical partitions or panels may be 
used. The additional information which is contributed 
by the remote robotic system is accommodated in the 
subpanels as shown in Figure 3. 

Context Panel 

In the general VIA design, this area contains informa- 
tion that is known about the overall problem context. 
In the tele VI A mode, the Context Panel is used to 
monitor the robot’s (or robots’) current activities. It 
is divided conceptually into three subpanels; 
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Figure 2: Overview of TeleSFX. 



Figure 3: Tele- VIA Blackboard. 



1. Interactive Configuration allows the local opera- 
tor to select appropriate sensors, and to commu- 
nicate sensing and backup plans to the robot. 

2. Interactive Exception Handling receives the sig- 
nal for help when autonomous exception handling 
fails. The remote system immediately posts the 
type of fsdlure, currently active sensors, and the 
belief table for those sensors. This tells the lo- 
cal operator what the perceptual status of the 
robot is at the time of failure, and provides ini- 
tial information for teleVIA to begin formulating 
hypotheses, and requesting further information. 

3. Current Context is a panel which is active during 
both interactive configuration and interactive ex- 
ception handling. It contains information about 
the task underway, the known environmental fac- 
tors and conditions, which sensors are active and 
working, and intermittent video images from the 
robot reinforcing the operator’s knowledge of the 
context within which the robot is currently func- 
tioning. 

In the initial design of teleVIA, the overall Context 
Panel is used simply as an informational tool for mon- 
itoring the robot. 

Hypothesis Panel 

This panel contains the current hypotheses that con- 
stitute the partial (or complete) solutions that are 
evolving as a result of the problem-solving activity. 
It is divided into two sub panels: 

1. Robot Hypotheses contains the hypotheses gener- 
ated by the teleSFX system at the remote site, 
and reflects the diagnostic and problem-solving 
activities carried out autonomously by the excep- 
tion handling mechanism of the robot. 

2. TeleVIA Hypotheses contains the hypotheses gen- 
erated by the knowledge sources of teleVIA, based 
on the information posted by the remote system 
in combination with more extensive knowledge re- 
trieved from the teleVIA knowledge base. 

Attention Panel 

This panel is the locus of the visual focus-of- attention 
mechanism. It is also partitioned into two parts: 

1. Attention Directives are issued by the teleVIA 
system in order to assist the local operator’s per- 
ception of relevant data. To accomplish this, tele- 
VI A may request particular images to be trans- 
mitted by the robot. In this way, delays due 
to transmission of unnecessary and/or extraneous 
data may be avoided. Furthermore, since the im- 
ages are selected by teleVIA’s knowledge sources 
according to the current problem, they are more 
likely to be pertinent and useful. The directives 
issued to the operator are then aimed at guiding 
him/her to look at particular aspects of the data 
provided by the remote system. 


2. The second area of the Attention Panel consists 
of one or more images, obtained from the robot 
by the teleVIA system. Depending on the sen- 
sory modality of the displayed images and/or data 
(e.g., video vs. infra-red vs. ultrasonics), tele- 
VIA will also automatically execute appropriate 
image enhancements designed to facilitate the op- 
erator’s perception of the feature(s) in question. 
In this manner, the superior perceptual capabil- 
ities of the human operator can be exploited in 
order to diagnose the problem more quickly. 

TeleVIA 

In Figure 4 are shown the components of the co- 
operative system which assists the human supervisory 
activities at the local site. TeleVIA contains four main 
control modules: Hypothesis Manager, Strategy Selec- 
tor, Attention Director and User Interface, together 
with a knowledge base which serves as the repository 
of long-term information in the system. The Hypoth- 
esis Manager impacts the blackboard through the ac- 
tivities of hypothesis-related knowledge sources. The 
Strategy Selector is used to pass control from the Hy- 
pothesis Manager to the Attention Director, since the 
way in which attention is focused may depend on the 
strategy used for reducing the list of active hypothe- 
ses. The Attention Director is concerned with focusing 
attention by presenting and enhancing images as well 
as suggestions to the operator of what to look at next. 
The User Interface is the component through which 
the human operator communicates with the teleVIA 
system. 


Hypothesis Manager 

The purpose of the Hypothesis Manager is to percolate 
information through the levels of the blackboard via 
the activities of the knowledge sources. Each knowl- 
edge source has a set of preconditions that must be 
satisfied by information at a particular level of the 
blackboard. It then performs a transformation of the 
information at one or more levels. Some examples of 
knowledge sources which are activated by the type of 
sensor involved in the failure are illustrated in the fol- 
lowing tables. 

K^S 1 

Precondition: 

The sensor involved is video. 

Action(s): 

Request latest image from robot. 

Post image. 

Retrieve and post video hypotheses. 


K-S 2 

Precondition: 

The sensor involved is infra-red. 

Action(s): 

Request latest image from robot. 

Post image. 

Retrieve and post infra-red hypotheses. 
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Figure 4: Overview of TeleVIA. 


K-S 3 

Precondition: 

The sensor involved is infra-red and image is 
posted. 

Action(s): 

Retrieve model from current context, 
if model = available then 

invoke model- false- color enhancement 
else invoke generic-false- color enhancement. 
Post image. 


Strategy Selector 

This module is invoked by the Hypothesis Manager 
when a knowledge source needs further information 
before proceeding. It examines the current blackboard 
configuration in order to determine an appropriate 
strategy for the next step in the problem-solving pro- 
cess. A High Level Plan is then generated to carry out 
the selected strategy, and is passed to the Attention 
Director for refinement and execution. 

Attention Director 

The Attention Director module takes the High Level 
Plan produced by the Strategy Selector, and con- 
structs an Attention Plan that contains detailed in- 
structions for focusing attention. The steps of the 
Attention Plan are based on the particular type of 
evidence that is needed to fulfill the mandate of the 
Strategy Selector. These steps are expanded with im- 
age enhancement procedures where appropriate, and 
are executed. Control is then passed to the operator 
for feedback. In this way, the system presents infor- 
mation, makes suggestions, and enhances the image(s) 
in such a way as to influence the direction of the op- 
erator’s problem-solving. 


User Interface 

The User Interface is divided into two parts: the 
Logical User View^ which controls how much of the 
blackboard is visible to the user, and the Presentation 
Manager, which controls the form of the interface as 
it is presented to the user. The Logical User View 
component of the user interface allows the system to 
be adapted for various purposes without compromis- 
ing its basic problem-solving approach. For example, 
when the operator is simply monitoring the robot, 
and performing interactive configuration, the panels 
involved in exception handling should be hidden from 
view. There may also be a certain amount of data 
posted to the blackboard, which is utilized by teleVIA 
in its hypothesis management, but which should not 
necessarily be visible to the operator. On the other 
hand, the Presentation Manager provides the actual 
human-machine interface of the system through a dis- 
played representation of the Logical User View. This 
may take a number of forms including menus, icons, 
graphics, and/or direct manipulation windows, and 
may even extend to audio as well as visual mecha- 
nisms. 

Experiments 

The experiments described here use data for scene 
recognition which was collected from two sources. Sce- 
narios 1, 2, 3 and 5 are based on sensor observa- 
tions collected from the Denning DRV mobile robot, 
George, at the Georgia Institute of Technology. Sce- 
nario 4 is based on sensor data from Clementine, the 
Colorado School of Mines’ Denning MRV-3 mobile 
robot. It should be noted that some of the data has 
been used in previous experiments. The exception 
handling activities at the remote in the experiments 
described in this paper differ from previous work [3] 
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because they make use of the more rigorous system 
developed in [1]. 

Five different types of sensors (an Inframetrics true 
infrared camera, a black and white video camera, a Hi8 
color camcorder, a UV camera and ultrasonics) pro- 
vided observations from George. Clementine supplied 
data sets from three sensors (a black and white video 
camera, a color camcorder, and ultrasonics). Both 
robots simulated security guards, where the task was 
to determine whether a student desk area of a clut- 
tered room had changed since the last visit. In the 
following scenarios the focus is on the activities of the 
teleVIA system in response to the request for help 
from the remote system. 

Scenario 1 

In the first experiment, the robot collected data for 
the desk scene while facing a different part of the room. 
This resulted in a “high conflict” type of state failure 
during fusion. The robot then generated a hypothesis 
of sensor malfunction, and attempted to run diagnos- 
tics on the two conflicting sensors. These diagnostics, 
however, showed that both sensors were working cor- 
rectly, and at this point, the robot could not proceed 
further, and signalled for assistance. 

At this point, the robot posts a request to the in- 
teractive exception handling panel of the blackboard, 
indicating the type of feulure it has encountered, and 
including the beliefs which led to this failed fusion. In 
the initial version of tele VI A, the images leading to 
this conflict are also transmitted. The first knowledge 
source of tele VIA is activated with the precondition 
that a video camera is involved in the failure. This 
causes the video image to be displayed first, together 
with a list of preliminary hypotheses of what could be 
the problem. Examples of hypotheses include: wrong 
input, sensor malfunction, sensor occlusion, sensor 
hardware error (missing data, self-diagnostic error), 
multiple sensor errors and electromagnetic interfer- 
ence. Since the purpose of the system is to provide as- 
sistance as quickly as possible, an assumption is made 
that, if applicable, images which are most easily per- 
ceived by humans (e.g., video images versus thermal 
images) are given priority. Thus, if by looking at the 
video image, the operator can immediately ascertain 
what the problem is and resolve it, then this most ef- 
fective solution to the problem should be supported. 
Once the operator selects the probable diagnosis, a list 
of repair possibilities may be posted to the interactive 
configuration panel for implementation. 

Scenario 2 

In the second experiment, the lens of the camera 
was covered in opaque plastic to simulate a sensor 
malfunction due to external factors such as dirt on 
the lens, for example. In this case, the fusion process 
resulted in a “below minimum” type of failure, and, 
as a result, the exception handling mechanism gener- 
ated a first hypothesis of “inadequate sensing plan”. 
A backup plan was then implemented, and sensor data 
was reaquired accordingly. The new plan added a color 
camera to the sensing suite, and subsequently a fusion 
failure of “high conflict” was encountered between the 
black and white camera and the color camera. As in 



Figure 5: Example Image for Scenario 2. 


the previous experiment, teleSFX then generated the 
hypothesis of sensor malfunction, performed diagnos- 
tics which denied this hypothesis, and then called for 
assistance. 

In this case, both of the failures are posted to the 
blackboard, together with the beliefs generated for 
each attempt. Once again, the primary troubled sen- 
sor is the black and white camera, and the image 
generated is shown in Figure 5. In this case, how- 
ever, since the second attempt introduced the con- 
flicting image from the color camera, both the black 
and white, and the color video images are displayed by 
teleVIA for the operator to examine first. In this case 
as well, the operator should be able to determine the 
problem fairly quickly by simply comparing the black 
and white video image with that of the color camera. 

Scenario 3 

In the third experiment, the lights were turned 
off during data collection to simulate an unforeseen 
change in environment. In this case the exception 
handling mechanism of the robot arrived at a correct 
conclusion of “environmental change” by testing the 
visible light information. However, for this type of 
problem, operator assistance is still needed for recov- 
ery, and therefore a message is posted to the interac- 
tive configuration portion of the blackboard request- 
ing intervention. The beliefs leading to the original 
state failure, together with the hypotheses generated 
and tested by the robot, are posted to the blackboard, 
while images and data from the relevant sensors (black 
and white camera, and UV sensor) are also displayed. 
This enables the operator to determine what type of 
environmental change may have occurred. 

In each of these experiments, the primary sensor 
involved in the problem was the black and white cam- 
era. Since these experiments were originally designed 
to test the autonomous exception handling capabilities 
of the teleSFX system, the results, when extended to 
the teleVIA component are somewhat artificial. How- 
ever, they serve the purpose to establish the type of 
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Figure 6: Polar Plot of Ultrasonics Data. 


information which must be communicated between the 
remote and the locsd systems in even such elementary 
scenarios. This allows us to determine the types of 
knowledge sources which may be activated, the dif- 
ferent types of hypotheses which may be needed, and 
how to present this information effectively using the 
blackboard mechanism. 

Further experiments are underway which empha- 
size sensor data which is not as easily perceived by 
the human operator, and which may require enhance- 
ment before conclusions may be drawn. In these cases, 
teleVIA knowledge sources are activated according to 
the type of sensor(s) involved in the state failure. This 
is then combined with knowledge of the current con- 
text to select appropriate enhancements, and display 
the information to the local operator. The following 
scenarios were constructed using images acquired by 
the robot for a drill press scene. 

Scenario 4 

In this example, it is assumed that the ultrasonics 
are contributing primarily to the fusion fedlure. In this 
experiment, one out of the 24 ultrasonics transducers 
mounted in a ring began to report widely fluctuating 
readings. A sensing failure of “highly uncertain” evi- 
dence was reported, but the responsible sensor could 
not be isolated, thereby necessitating aid from the op- 
erator. The raw ultrasonic readings that come from a 
Denning mobile robot are just numbers, which repre- 
sent measurements in feet. However, when this data 
is represented as a polar plot as in Figure 6, it is much 
easier to notice if one or more of the sensors is giving 
erroneous readings. This is further reinforced if the 
numerical data are examined in the light of knowledge 
about the current context, for example, that a room 
(or mine shaft) is thought to have certain dimensions. 
A further enhancement of the data which can aid the 
local operator is an occupancy grid, which presents a 
bird’s eye view of what the robot has sensed so far. 
The robot builds up this grid or map as it processes 
ultrasonics data. With both of these types of displays. 



Figure 7: Equal Band FeJse Color Enhancement. 


the operator is more likely to diagnose the failure of 
an ultrasonic transducer or board, or to detect an er- 
roneous reading. 

Scenario 5 

When the sensor in question during exception han- 
dling is the infra-red camera, enhancements are once 
again needed to assist the operator’s perception of the 
information in the image. In this case, the untouched 
true infra-red image is typically gray scale, and there is 
often not a great deal of discernable contrast in the im- 
age. It is common practise to add false color to such an 
image to show the heat distribution. However, certain 
choices of false color maps still do not enhance the im- 
age, and may obscure the details even further. In the 
drill-press example, dividing the grayscale into 6 equal 
bands of color leads to a primarily yellow image, due 
to the extreme heat of the drill press. A grey scde ver- 
sion of this image is shown in Figure 7. However, if the 
selection of false color map is knowledge-based, utiliz- 
ing model-specific information about the drill press, 
for example, then a more appropriately enhanced im- 
age is produced, making it easier for the operator to 
see the heat profile represented as blue, green, red, yel- 
low, and white bands. This is illustrated in the grey 
scale rendition in Figure 8. 

Future Work 

Current work is concentrating on constructing ex- 
periments in real-time where an operator at Clark At- 
lanta will interact with the remote robot (Clementine) 
at Colorado School of Mines. An important issue 
which has not been addressed in this work so far is 
that of learning. The robot will typically be work- 
ing in hazardous and/or remote environments about 
which little may be known, and therefore it is difficult 
to anticipate the types of problems which may arise. 
Not only would it be desirable to increase the auton- 
omy of the individual robot wherever possible, but the 
knowledge gained from solving these problems could 
be disseminated to other robots in the field. Further- 
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Figure 8: Model-Specific False Color Enhancement. 


more, if the teleVIA system could “remember” cer- 
tain interactions, these could immediately be retrieved 
from memory, rather than having to generate the same 
session over again. The technique of case-based rea- 
soning is a natureJ candidate for this type of learning. 
Each interactive exception handling session may be 
captured as a case, which would be indexed on features 
such as particular configurations of sensors and failure 
types. Such a case could also include relevant images, 
or at least image types and enhancements used, so 
that teleVIA would simply use a case retrieval mecha- 
nism rather than a potentially complicated reasoning 
strategy. Certain aspects of the exception handling 
and recovery procedures might also then be commu- 
nicated to the robot itself, to extend its autonomous 
capabilities, especially for recurrent problems. 

Summary and Conclusions 

This paper presents a new approach in semi- 
autonomous mobile robots, which reduces the level of 
human supervision and provides intelligent assistance 
for problem solving. The approach partitions problem 
solving responsibilities between the remote and the lo- 
cal machines. The remote system monitors its sensing 
for anomalies, called sensing failures, using teleSFX. 
If a failure occurs, it attempts to classify the source 
of the problem using a generate and test methodol- 
ogy. If it is successful in identifying the source, it 
then attempts to autonomously recover (e.g., go to 
back up sensors, change parameters). Otherwise, if 
the source cannot be classified, or if no recovery strat- 
egy is available, the local machine must provide the 
exception handling. Exception handling at the local 
is done by the operator, with the help of teleVIA. Tele- 
VIA uses a common blackboard to cooperatively assist 
the operator by posting what has been done by the 
remote, displaying and enhancing sensor data needed 
in ascertaining the problem, and managing diagnostic 
hypotheses and beliefs. Experimental scenarios using 
data collected from mobile robots illustrates the oper- 


ation of the system. 

The advantages of this type of tele- assistance fall 
into three categories. First, it is practical. It reduces 
the need for direct human supervision and communi- 
cations by having the remote monitor itself for failures. 
Second, it increases efficiency by freeing the opera- 
tor to supervise multiple remotes, reducing cognitive 
overload by controlling the presentation and enhance- 
ment of sensory data from the remote, and aiding the 
problem-solving process through hypothesis manage- 
ment and expert guidance. Three, the approach sup- 
ports the incremental addition of artificial intelligence 
as more progress is made in learning and planning. 
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Abstract 

Autonomous mobile robots need to integrate many 
different skills in order to perform complex tasks. 
In particular, they need to explore, sense, map and 
navigate in unknown or partially known environ- 
ments. This paper describes a robot system that 
is designed to perform a find-and-deliver task in an 
office-building-like environment. The robot’s initial 
orientation and location within the environment are 
not known, but the robot does have an a-priori map 
of the environment. We describe a sensor-based map 
representation that the robot uses while exploring its 
environment. We also describe how the robot deter- 
mines its initial position and orientation within the 
environment, how it explores the environment for a 
visually-tagged object, how it recognizes the object 
and how it delivers the object. The robot also up- 
dates its map to reflect changes in the environment. 
While the entire robot system has not yet been in- 
tegrated, each subsystem described in this paper has 
been implemented and tested. 

Introduction 

Autonomous mobile robots need to explore, sense, 
map, navigate and perform tasks in the environments 
in which they find themselves. Often these five func- 
tions are studied separately, with little or no atten- 
tion given to how they are all integrated to produce a 
completely autonomous mobile robot. In this paper 
we concentrate not on completely describing any sin- 
gle aspect of robot exploration, sensing, mapping or 
navigation, but instead on how many different skills 
can be integrated into an autonomous robot that per- 
forms a sophisticated task. Unfortunately, time con- 
straints prevented a complete integration of all of the 
described skills on the mobile robot. All of them, how- 
ever, were tested individually and their integration is 
planned. 

*Now at The MITRE Corporation, 1120 NASA Road 
1, Houston TX 77058, e-mail: korten@aio.jsc.nasa.gov 
Copyright (c) American Institute of Aeronautics and 
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Task description 

The task our robot is designed to perform is to find 
a single, visually tagged object somewhere in a large, 
office-like environment and to “deliver” the object to 
a designated room. The robot is given a crude map 
shortly before being asked to perform the task. How- 
ever, the map does not show obstacles that may block 
hallways or doors, nor does the map show all of the 
doors in the environment. The robot does not know 
its starting position or orientation with respect to the 
map. The delivery object is in one of the rooms and is 
a coffee pot marked with a black-and-white ‘X’, The 
robot need not actually pick-up the coffee pot, only 
approach it. Some, but not all, of the doors are tagged 
with a visually distinct bar-code; bar-coded doors are 
noted on the map and the delivery room will be one 
of them. The robot has 30 minutes to complete the 
task, which was one of three tasks that comprised the 
AAAI ’93 Robot Competition and Exhibition held in 
Washington DC on July 11-16, 1993. 

The task is challenging to mobile robots because it 
requires the integration of many mobile robot skills. 
The robot must initially explore the environment and 
determine its position and orientation with respect 
to the a-priori map. The robot must then plan an 
exploration strategy that will allow it to examine each 
room for the coffee pot. This strategy must be flexible 
in the face of unexpected obstacles. Finally, the robot 
must use visual sensing to detect the coffee pot, plan 
a path from the object to the delivery room and then 
follow that path. 

Robot description 

Our robot is a Cybermotion K2A called CARMEL 
(Computer-Aided Robotics for Maintenance, Emer- 
gency and Life Support) (see Figure 1). It has a 
ring of 24 sonar sensors and a rotating Bi^W cam- 
era. Three computers are on-board CARMEL, one 
computer each for the motors and sonar sensors and 
a 486-PC for high-level processing. The 486-PC has 
a framegrabber and performs all image processing. 
CARMEL has a basic obstacle avoidance competence 
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Figure 1: The mobile robot CARMEL. 


provided by an algorithm called VFH [2, 3, 4]. VFH 
constructs a certainty grid of sonar hits and uses it 
to continually compute a new direction that will take 
the robot towards its target while avoiding obstacles. 

Overview 

We first present the robot’s representation of its 
environment. This is the representation that is en- 
tered into the robot from an a priori map. The robot 
must then register itself (i.e., determine its orienta- 
tion) with respect to the environment; we give a basic 
registration algorithm. Next the robot must localize 
itself with respect to the a priori map; two different 
localization algorithms are presented. Once the robot 
is registered and localized, it can begin exploring the 
environment and looking for the coffee pot. We de- 
scribe our vision algorithm to detect the coffee pot 
and also describe how we update the a priori map to 
reflect changes in the environment. Finally, the robot 
must navigate from the room that contains the coffee 
pot to the delivery room. This sequence is shown in 
the flow chart in Figure 2. 

Representing the environment 

The map in our representation is a graph of nodes. 
Each node represents a region of the environment that 
has a common sonar signature. Each link between 
nodes represents a bidirectional connection between 
the two regions. The first issue when creating such 
a representation is to decide on an appropriate sonar 
signature that will distinguish between different re- 
gions. 

Detecting region boundaries 

There have been many approaches to using sonar 
sensors to define distinctive places in an environment, 
including [10, 1, 6, 7, 9, 8]. In our approach, a region 
in the environment is characterized by having a com- 
mon sonar signature throughout its extent, where the 
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Figure 4: The VFH obstacle avoidance algorithm. 


signature is the pattern of free or blocked space to the 
front, back and sides of the robot. Thus, there are 16 
unique sonar signatures in a rectilinear environment 
(see Figure 3 for a complete listing of the 16 sonar sig- 
natures). Our approach is unique in that it is directly 
tied to an obstacle avoidance algorithm — the Vector 
Field Histogram (VFH) [4]. 

The VFH algorithm first creates a histogram grid, 
which is a certainty grid representation of the objects 
surrounding the robot as detected using the robot’s 
sonar sensors. VFH then takes a local window of the 
certainty grid and converts it into a polar represen- 
tation called the polar histogram. A certainty grid 
and its corresponding polar histogram are shown in 
Figure 4. The polar histogram shows the obstacles in 
each direction around the robot. To avoid obstacles, 
VFH simply chooses the free direction of travel that 
is nearest to the desired direction of travel. This same 
polar representation is used to produce the sonar sig- 
nature. 

A simple example will best show how the polar his- 
togram is used to detect a region boundary. The robot 
is started down the hallway (the direction of the hall- 
way is determined by an algorithm described in Sec- 
tion 3) and VFH automatically aligns the robot and 
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Figure 2: Flowchart for accomplishing the find and deliver task. 
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While our boundary detection algorithm works fine 
in hallway environments, it has not been extensively 
tested in rooms. We rely instead on the dead reckon- 
ing capabilities of our robot to move into and out of 
rooms. Extending our approach to rooms as well as 
hallways is a topic of future research. 
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Figure 5: Detecting region boundaries using VFH. 

positions it in the middle of the hallway. When the 
robot is positioned in the middle of a hallway the po- 
lar histogram has two “mountains” for the two walls 
of the hallway (Figure 5(top)). The presence of a 
“mountain” means that the robot is blocked to that 
side. In this example, the sonar signature is: (front 
= open, back = open, right = closed, left = closed). 
As the robot moves down the hallway and approaches 
the doorway, the “mountain” on that side of the robot 
will disappear (Figure 5(bottom)). So the sonar sig- 
nature is now: (front = open, back = open, right = 
open, left = close). By “camping out” at the polar 
histogram segments corresponding to the front, back, 
left and right of the robot, changes in the sonar sig- 
nature can be immediately detected. 

In tests on the repeatability of this algorithm, 
CARMEL was asked to repeatedly stop at the same 
region boundary in the basement of our laboratory. 
Over ten consecutive runs, the largest difference in 
position along the hallway’s axis between any two 
runs was 520mm and the largest difference in posi- 
tion perpendicular to the hallway axis along any two 
runs was 290mm. During these runs, obstacle avoid- 
ance was performed and the robot was running at a 
speed approaching 400 mm/sec. 


Map representation 

Each region of the environment, which corresponds 
to a sonar signature, is represented by a node. A node 
contains the extent of the region (i.e., its length and 
width), a global (x,y) position of the center of the 
region and connections to neighboring regions. Fig- 
ure 6 shows an example configuration of the arena 
where the robot will be working. As shown in Fig- 
ure 7, the whole area is divided into regions based on 
the sonar signature. The regions are further distin- 
guished by either being a hallway region or a room 
region. Each hall section has one node at the center 
of the hall (nodes with “H” prefix). Every exit of the 
room also has one node close enough to the entrance 
(nodes with “R” prefix). Each room section has some 
extra virtual nodes (nodes with “V” prefix) for each 
side of the walls of the room. These virtual nodes 
serve two purpose. First, they are used to figure out 
the boundary of the room. Since each node has its 
(x, y) coordinate, we need at lezist two room-nodes to 
calculate the boundary of a rectangular shape room. 
Second, they are used for map modifications which 
will be explained later in this paper. 

Rooms can have up to four exits, one each to the 
north, south, east and west. If a room has more than 
one exit on each side it will be split into several vir- 
tual rooms. Large open space, such as lobbies, are 
also classified as rooms and may have to be split into 
several virtual rooms. For example, rooms 7,8 and 9 
in Figure 7 are all virtual rooms contained within a 
single open area. 

Registration 

In order for our representation scheme to work, 
CARMEL must be able to determine the main axes 
of the corridors, so that it can start searching for re- 
gion boundaries to its left, right, front and back. We 
call this registration. Currently, CARMEL can only 
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Figure 6: Arena Configuration 
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register itself in a hallway; if CARMEL starts in a 
room it must wall-follow until it enters a hallway. To 
register in a hallway, CARMEL starts to travel in any 
free direction. As it travels, the VFH obstacle avoid- 
ance algorithm will automatically align CARMEL be- 
tween the two walls of the hallway. While moving, 
CARMEL saves its (x,y) positions along the way and 
fits a line to them. The orientation of this line is used 
to determine the axis of the hallway. CARMEL can 
reregister during the task to correct dead reckoning 
errors. 

During initial registration, when the robot has no 
information as to its orientation in the environment, 
CARMEL also stores and averages the direction of 
free space. This should always fall along the axis of 
the hall. However, obstacles in the hallway, door- 
ways, and intersecting corridors cause CARMEL to 
drift from the middle of the hall. Therefore line fit- 
ting, which is a simple chi-square fit, is used. This 
occurs after CARMEL has traveled a minimum dis- 
tance and the rate of change in the average free-space 
direction falls below a threshold. If CARMEL be- 
comes trapped before this time, it turns around and 
starts the process again assuming that it has reached 
a blockade in the passage or the end of a hall. If 
the orientation of the fit line is too far from the aver- 
age free-space direction, it is assumed that CARMEL 
has not been traversing a hall or has “fallen” into 
a room or an intersecting hallway. In either case, all 
data are disregarded and the entire process, including 


wall-following if necessary, is repeated. 

For reregistration during the task, the previous ori- 
entation can be used to judge the accuracy of the cal- 
culated orientation. When there is a large difference 
between the previous and new orientations, either the 
old one can be maintained or the process can be re- 
peated. Maintaining the old orientation repeatedly 
is dangerous because CARMEL’s orientation can be- 
come very inaccurate over a period of time. 

We evaluated the registration algorithm in two sit- 
uations. The first situation was in a corridor with 
no obstacles or openings into rooms or intersecting 
hallways. These experiments set out to confirm that 
the algorithm will correctly identify the hall axis re- 
gardless of CARMEL’s initial orientation. In the sec- 
ond set of experiments, CARMEL was placed in a 
more complex area which included an obstacle and 
an opening into a room. The intention of these runs 
was to determine the robustness the algorithm, as it 
currently stands, in a more realistic situation. 

Twenty-seven runs were carried out in the obstacle- 
free hallway. CARMEL’s initial orientation with re- 
spect to the hall axis varied from 10 to 170 degrees in 
20 degree steps. CARMEL’s initial orientation was 
determined by eye and so is inaccurate by up to a 
degree or two. At each initial orientation, three runs 
were made. CARMEL determined the actual hall- 
way axis to within six degrees in all but one run. In 
this case, the calculated hall axis was off by nine de- 
grees. No run required more than approximately four 
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meters. The accuracy of the registration and the dis- 
tance covered during a run were acceptable and within 
the limits of the environment that was expected to be 
encountered. 

The second set of runs had two parts. The first part 
was carried out with CARMEL given an initial orien- 
tation of 30 degrees. The second part used an initial 
orientation of 70 degrees. Eight runs were carried out 
with each orientation. In the first part, CARMEL 
determined the hall axis to within five degrees each 
time, except one in which it wandered into the room 
through the opening. The average distance required 
was approximately five meters. In the second part, 
CARMEL entered the room twice, but determined 
the hallway axis to within four degrees in the other 
six runs. The average length of the successful runs 
was about 2.6 meters. 

While these runs were far from exhaustive, they 
do show that this method of registration is useful. 
The most difficult problem is that of wandering into 
a room. This can either be avoided or detected, with 
the first preferable. The difficulty with preventing 
CARMEL from drifting into a room is that there is 
no simple way to distinguish between an opening into 
a room and a narrowing of the corridor due to obsta- 
cles. The former should not be entered while the lat- 
ter should be. Detecting the entry into a room should 
be simpler. The chi-square fit provides a goodness of 
fit measure, namely When CARMEL enters a 
room, its path is generally straight but roughly per- 
pendicular to the hall axis. This should yield a very 
poor value for x^- The use of this value and the re- 
turn of CARMEL to the hallway it left are still being 
investigated. 

Localization 

Once registered, the next critical issue is the deter- 
mination of the correct location and orientation of the 
robot. We call this process localization. CARMEL ac- 
complishes localization through the accumulation of 
information, in the form of local sonar signature fea- 
tures, during its initial movement through the halls 
of the “office” environment, and through observa- 
tion of visual tags identifying doors. We would like 
CARMEL to localize itself as quickly as possible, how- 
ever, so that in the absence of door markers we try to 
use the sonar signature features. In both approaches 
CARMEL is given a map representing the environ- 
ment in which it will be placed. However, the map 
can be in error in that doorways may exist where they 
are not so indicated on the map, and doorways may 
be blocked where they are indicated on the map. The 
localization schemes must therefore deal with these 
problems. 

We have implemented two approaches, one based 
upon heuristics and confidence factors, the other upon 
probabilistic reeisoning using a belief network. Our 
localization methods only work in hallways, so that 
if CARMEL’s initial location was within a room we 
would first have to find an exit using a wall follow- 
ing behavior. In this section we describe each of the 
approaches and show them in operation. Although 


both were implemented, neither have actually been 
fully integrated into the office exploration system. 

Rule Based Localization 

One method of localization that has shown to be 
successful is a rule based system. Before CARMEL 
makes a move during rule based localization, it com- 
putes scores over all of its possible starting locations 
in the a-priori map. 

Creating a Score Distribution 

Since direction is ambiguous to CARMEL at first, 
we run our scoring algorithm four times, rotating 
CARMEL’s map to a new cardinal orientation each 
time. So for n possible starting locations, there are 
4n total scores computed. 

The basic scoring algorithm is a modified depth- 
first recursion, which runs as follows: 

The first feature node seen by CARMEL is com- 
pared with the start node in this orientation. The 
comparison scores points depending on how many 
sonar signature features (i.e., walls or openings on the 
four sides of the robot) match and on the measured 
extent of a region. However, points scored for extent 
matching are fewer because we have determined that 
distance data tend to be more erroneous than the de- 
tected sonar signature features. 

Each node adjacent to the current one on 
CARMEL’s constructed map is checked to see if it has 
been examined yet in this orientation. If the neigh- 
boring node has not yet been examined, then it is 
compared with the node corresponding to it on the 
a-priori map. The algorithm continues this recur- 
sively for all of the paths the robot can follow from 
the start node. The score for each recursion is added 
to the total score for that start node in that particular 
orientation. 

While the algorithm recurses, it also tries to answer 
this question: 

If CARMEL started in this start node with this 

orientation, where would CARMEL be now? 

If the algorithm can determine this, it makes note of 
the fact. We refer to a current location inference of 
this type as a location resolution. There is no guar- 
antee that a start node-orientation configuration will 
produce a location resolution. 

In the event that a path to an adjacent node exists 
in CARMEL’s map but a wall exists on the a-priori 
map, the routine attempts to figure out possible lo- 
cations across the wall that might correspond to the 
node CARMEL saw. If a possible match is found, 
then the routine continues from there; otherwise, no 
points are scored for that area on CARMEL ’s map. 

After all of the location-direction combinations are 
examined, the algorithm normalizes the raw scores 
by computing the mean and standard deviation of 
the score set. Then each original score is replaced 
by the number of standard deviations the score was 
above the mean. The resulting scores are now less 
dependent on the number of nodes seen by CARMEL. 
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Figure 8: Map of the experimental space showing 
a priori regions. 


Move Planning 

CARMEL scores the possible moves it can make 
baised on the location resolutions. CARMEL looks 
at each location resolution that it has and computes 
a shortest path to the nearest door marker for that 
resolution. The first move of this path is considered. 
This first move is either one of the four directions, or 
no movement at all (the special case where CARMEL 
is hypothetically in the vicinity of a door marker). 

For each first move of a particular type, weight is 
added to that corresponding move possibility. The 
weight added depends on the score found for the 
location-orientation pair that the first move was de- 
rived from. If CARMEL saw a door marker at this 
stop, or if any location-orientation score for one of 
the first moves is above a certain threshold, then the 
“don’t move” move choice is given a score of infinity, 
and CARMEL assumes localization is complete. In 
the former case, CARMEL assumes it is now at the 
location on the a-priori map where the door marker 
is. In the latter case, CARMEL assumes it is at the lo- 
cation resolution found for the above-threshold score. 

As a final factor in move choice, we programmed 
CARMEL to select from these possible movement 
choices the highest scoring direction that has the 
shortest path to an unexplored region on CARMEL ’s 
map. This ensures that CARMEL covers unexplored 
space as efficiently as possible while searching out 
door tags. It also guarantees that CARMEL will not 
‘paint itself into a corner’ or oscillate between ad- 
jacent nodes while exploring (two problems that oc- 
curred without this adjustment). 

Experimental Results 

Here are some results of a typical run with the rule 
based algorithm. The map given CARMEL is shown 
in Figure 8, and a door tags are located at nodes 0, 
6, and the north end of 7. 

We placed CARMEL in feature region 2 and 
aligned the robot so that it faced south on the map. 
CARMEL W 21 S told that it was starting somewhere 
along the south hall (nodes 0-6). CARMEL localized 
after the third move without using door markers. 

In Table 1, the Move # column shows the current 
move. The Best Resol v^ column reports the best 


location-direction pair, i.e., CARMEL’s top choice(s) 
for where it may be. The number is the feature node 
label corresponding to the map, and the direction is 
the direction CARMEL thinks it’s facing. For exam- 
ple, 2-south means CARMEL thinks it may be at 
node 2 facing south. 

The ‘Best’ Move(s) column indicates the best 
moves computed by the possible move scoring rou- 
tine. Directions are displayed here as the true direc- 
tion on the map for ease in interpretation. Multiple 
directions indicate a ‘tie’, in which case the final move 
is chosen from them based on exploration preference. 

The Move Choice column is the actual move made 
by CARMEL. It may differ from the previous column 
if CARMEL chose an unexplored area over a high 
score direction. 

It may seem strange at first that the ‘best’ move 
and the move choice are totally uncorrelated after the 
first move. One must remember that the ‘best’ move 
is a result of weighting all possible moves for all resolv- 
able pairs, so it doesn’t necessarily represent the true 
best move that can be made, especially in a symmet- 
ric environment. The moves chosen were carried out 
because CARMEL picked a direction, and preferred 
to explore new area on a ‘next best’ score rather than 
backtrack on a best one (which only may be best by 
a margin). Also, it is important to remember that 
the Best Resolved nodes are not the only nodes used 
in determining direction. All of the possible location 
resolved nodes are considered, weighted only by the 
location-orientation score associated with them. The 
Best Resolved values are therefore only displayed to 
show how quickly the algorithm can localize. 

Belief Network Approach 

In the second localization approach, the depen- 
dence of the sensed features on the world map, the 
robot’s initial orientation, and the direction of travel 
of the robot as it attempts to localize itself, is mod- 
eled using a belief network [5, 11]. As the robot moves 
about and sees new features, the belief network accu- 
mulates a history of the features observed and the 
movements that the robot has made. These obser- 
vations can then be propagated through the network, 
resulting in a probabilistic distribution over the possi- 
ble locations the robot may be in currently. The robot 
considers itself localized when one of the locations 
achieves a level of confidence about a certain thresh- 
old. If CARMEL is not yet localized, it can use this 
distribution to determine the most likely direction in 
which to travel to facilitate better localization. Cur- 
rently, this amounts to moving in the direction most 
likely to take it to a room tag, the most unambiguous 
localization feature detectable by CARMEL. 

Belief network operation 

The belief network that we used is shown in Fig- 
ure 9. This network models the dependencies between 
the robots initial location, its initial orientation, and 
the sonar feature that it “sees” . The modeling is ac- 
complished both through the topology of the network 
as well as the probability tables (both conditional and 
priors). The conditional probability that a certain 
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Move # 

Best Resolved 

‘Best* Move(s) 

Move Choice 

start 

2-south, 4-south 

east, west 

east 

1 

3-south, 5-south 

west 

east 

2 

4-south 

west 

east 

3 

5-south 

no move 

no move 


Ta 


ble 1: Results from an experiment using rule-based localization. 


feature is detected, given a particular location and 
orientation, is based on heuristic calculations of the 
correlation between what should be observed and that 
which is actually observed. The conditional probabil- 
ity that the robot is in a particular location, given a 
previous location and orientation, is a simple boolean 
function based on the map, where the probability is 
1.0 if the locations are adjacent and joined by a path, 
and 0.0 if they are not. 

Upon initialization, an observation of the robot’s 
initial surroundings is placed in the FeatureI node 
of the network and then propagated throughout the 
network. The resulting posterior probability distri- 
bution in the Location 1 and Orientation nodes 
reflect the evidence’s impact upon the likely starting 
location and orientation of CARMEL. The robot can 
use this revised information in its planning to either 
facilitate improved localization or to switch to explo- 
ration of the office environment if the probabilities are 
suitably high enough to justify this. 

In the situation where the resulting probabilities 
are such that CARMEL is still unsure enough of 
where it is to warrant further localization, CARMEL 
plans and executes a move in a direction most likely 
to take it to a door tag in the shortest amount of time 
(i.e., shortest distance). When it detects a change in 
the sonar features around it, it stops and makes an- 
other observation. The new sonar feature, as well as 
the motion the robot made to get to its current loca- 
tion, is fed to the belief network as evidence, propa- 
gated, and the resulting probability distributions ana- 
lyzed. This cycle continues until either CARMEL be- 
comes sure enough of its location based solely upon 
the sonar features so far detected, at which time it 
switches to exploration mode, or CARMEL detects a 
door tag, at which time it knows with certainty where 
it is, and similarly switches to exploration mode. The 
belief network in Figure 10 shows the belief network 
at iteration 3 in the process. As can be seen in this fig- 
ure, the belief network grows at each iteration, adding 
new Location, Motion, and Feature nodes. The 
portion of the network that includes the Move node 
models the dependence of the robot’s new location 
upon the previous location, the original orientation, 
and the move the robot made to get to the new loca- 
tion. 

Experimental Results 

We evaluated the belief network’s ability to localize 
CARMEL in the halls of the University of Michigan’s 
Artificial Intelligence Laboratory. The map for the 
region is shown in Figure 8, with the possible local- 
ization locations indicated by the numbered regions. 
Each of the labeled locations represents a region of 



Figure 9: Initial belief network architecture. 



Figure 10: Belief network architecture showing the 
network at iteration 3 of the process. 
The Feature and Move nodes are in- 
stantiated as evidence, and the Loca- 
tion and Orientation nodes are inferred. 


the map that has the same sonar feature type. Trav- 
elling between regions (locations), then, implies that 
the sonar feature must change at the transition point 
between the regions. 

As an example, suppose CARMEL starts in loca- 
tion 2, the T-intersection at the South end of the 
map, and is initially facing South. The sonar fea- 
ture observed would be that of a single blocked direc- 
tion, that directly in front of it. Passing this evidence 
to the localization network, the resulting probability 
distribution for the current location is shown in Ta- 
ble 2(top), while the posterior distribution of the ORI- 
ENTATION node is shown in Table 2(bottom). These 
state that CARMEL is either in Location 2, 4, 9 or 
11, and is most likely to be facing South. 

If CARMEL then moves West (to it, East on the 
map), it will move until it enters region 3, which has 
a different sonar feature. The new feature, that of an 
East-West hall, and the West move that CARMEL 
made, are both given to the localization network and 
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0.13 

0.04 

0.04 

0.04 

0.04 

0.13 

0.04 

0.13 

0.04 

0.04 

0.04 


0.27 


0.44 


0.15 


0.14 


Table 2: Probability distribution of location (top) and orientation (bottom) at the start location 


propagated. The new probabilities for the current 
location and orientation are show in Table 3, which 
says that CARMEL thinks that it most likely to be 
at Location 10, and is most likely to be facing South. 
Again moving West (to it, East on the map), so that 
CARMEL sees the new feature at Location 4, yields 
distributions shown in Table 4. 

Taking this probability distribution, CARMEL now 
has greater than 90% confidence that it is at Loca- 
tion 4 and was initially facing South. If there was a 
door tag at the entrance to the room at Location 4, it 
could then visually verify that this inference is correct. 
CARMEL can now reorient itself correctly to the map 
and position itself at the transition point between the 
current location (Location region 4) and the previ- 
ous location (Location region 3). Note that since the 
belief network has nodes representing each of the pre- 
vious locations also, it is easy to reason about where 
the robot has already been simply by looking each 
of the nodes and determining the highest probability 
state in each. This facilitates exploration updating 
in that extra time to perform a backtracking search 
through the map with the previously performed mo- 
tions doesn’t have to be done in order to see what has 
already been explored. 

Exploration and Navigation 

Once CARMEL is localized it can begin to look 
for the coffee pot. The first step in this process is 
to plan an exploration path. An exploration path 
is an exhaustive sequence of rooms to visit from the 
robot’s current location. The sequence is determined 
based on the travel distance from the current location. 
CARMEL first selects the closest room (in terms of 
travel distance, not the Cartesian distance), then adds 
another room closest to the selected room, and so on. 
After planning, CARMEL traverses the exploration 
path stopping in each room to scan with its camera 
for the coffee pot. Exploration is terminated when 
the coffee pot is found. The exploration path can 
be modified to accommodate unexpected blockages 
or openings. 

Planning exploration path using closest-node-first 
(hill-climbing) method does not necessarily generate 
the optimal path in terms of total traveled distance* , 
but in practice this method turned out very fast and 
the resultant path was quite reasonable. For example, 
the exploration path from, say, “H18” (in the middle 
of the map in Figure 7 ) would be (R4, R7B, R3, R8A, 
R9A, Rl, R6, R5, R2A). This sequence specifies only 
the room nodes to visit in that order. 


* Finding the optimal exploration path amounts trav- 
eling sales man problem 


Updating the map 

While exploring CARMEL can update its a-priori 
map to reflect blocked hallways and doorways and to 
note additional doorways that were not in the original 
map. For blocked hallways or doorways, the connec- 
tions between the two nodes in the map are cut and 
the sonar characteristic of the node is modified ap- 
propriately. In the case of unexpected openings, new 
nodes are created, assigned the appropriate signature 
and connected to adjacent nodes. This information 
helps CARMEL find the most efficient route to the 
delivery room once the coffee pot has been found or to 
replan its exploration path. For example, if the robot 
sees an unexpected opening to a room from a hall 
node, it finds the nearest node (either virtual or real 
room node) of the room and create a new link to that 
node. Note that each node can have maximum four 
connections (roughly corresponding to North, West, 
South, and East) to other nodes. Figure 11 shows a 
part of the map before the robot sees an unexpected 
opening at west side of the hall “H5”. It first fig- 
ures out which room is next to west of “H5” from the 
boundary information of each room. In this case, it is 
Room 5 and “V5E” is the closest node of that room. 
Now “H5” should be divided into three sections since 
the sections are divided based on the sonar-reading 
changes and new opening will change the sonar sig- 
nature as shown in Figure 12. 

Visual sensing 

As CARMEL enters each room it scans for the cof- 
fee pot. CARMEL’S vision system finds predefined 
markers (a black ‘X’ on a white background in the 
case of the coffee pot) in the environment and deter- 
mines their pose (3D position and orientation) rela- 
tive to the robot. We will describe the algorithm for 
detecting the ‘X’ here. The algorithm for determining 
the pose of the ‘X’ is described in [12]. 

Marker detection 

The marker detection phase is composed of two 
main routines: the connected components routine 

and the marker identification routine. The detection 
phase must be both fast and accurate for the pose 
estimation algorithm to be useful for real-time tasks. 

To maximize speed, we make only one pass through 
the entire image. During the pass, the image is 
thresholded and connected components are found and 
labeled. One pixel components are ignored and not 
labeled. Size thresholding then filters out most of 
the non-marker components. Only one pass is made 
through all possible connected components. 

To identify or reject the remaining markers, we 
use a weighted pattern matching template. An nxn 
template matrix is created for each marker (see Fig- 
ure 13). 
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Figure 13: Weighted pattern template for the ‘X’ 
markers. Positive values indicate ex- 
pected black areas; negative areas are 
expected to be white. Certainty in- 
creases with magnitude. 



Certainty^ = ^ = 0.9583 

. . V f \xrc\ if correct color 
/*(’•> I 0 otherwise 

Figure 14: Sample marker with calculated ‘X’ cer- 
tainty value. “b” indicates a black 
pixel; “w” indicates a white pixel, r 
counts rows; c counts columns. 


Increasing n increases the resolution of the tem- 
plate, but also increases the process time. We found 
n = 7 to be a good compromise. This weighted tem- 
plate indicates which areas are expected to be black 
and which ones white. The weights for our matrix are 
currently determined by trial and error, but we could 
easily replace these with machine generated weights 
if a learning program were implemented. The marker 
template which a component most resembles is se- 
lected as the “guess” for that component. The pro- 
gram generates a certainty measure with each guess 
(see Figure 14) and uses this measure to accept or re- 
ject the guess. Each marker can have one or more 
templates. The additional templates may be used 
to improve marker recognition from views other than 
straight on. 

We also use additional heuristic information in 
identifying the markers. Some heuristics were not 
learned or incorporated until after the program had 
been tested. For example, diagonal lines often scored 
high enough certainty values to be considered ‘X’s. 
Once we realized this, adding a specific test to ver- 
ify that each possible ‘X’ is not a diagonal line solved 
this problem. To avoid slowing down the program too 


much, specific heuristic tests were kept to a minimum. 

Navigation 

Once the coffee pot is found, CARMEL uses the 
vision algorithm’s estimation of the pot’s relative lo- 
cation to approach it. Since CARMEL doesn’t have 
a manipulator, it is assumed that once CARMEL has 
approached the coffee pot it has “grabbed” it and can 
then deliver it to the delivery room. CARMEL plans 
the shortest path to the delivery room using a stan- 
dard shortest-path algorithm. CARMEL then follows 
the path by moving from region to region and detect- 
ing region boundaries with its sonar sensors. There 
are numerous error recovery routines that can cope 
with changes in the environment and sensor errors. 

Conclusion 

Unfortunately, time constraints leading up to the 
competition prevented the complete integration of all 
of the described skills. In particular, the robot did 
not perform registration or localization at the compe- 
tition. Instead the robot was told its orientation and 
position. During the actual competition, the robot 
explored several rooms before becoming hopelessly 
lost, at which time the run was terminated. The most 
difficult problem encountered was tuning the sonar- 
based region-finding algorithm to the particular en- 
vironment. While the algorithm had worked fine in 
our testing environment (the basement of our labora- 
tory), different characteristics of the competition en- 
vironment caused many false detections (i.e., defining 
the start of a new region when there wasn’t one) and a 
few missed detections (i.e., not detecting a new region 
when there was one). Since the robot’s localization 
depended on matching the regions it found with the 
a priori map, it became lost very quickly. 

Our experience demonstrates an important lesson 
in mobile robotics — if the low-level sensing of the 
world is not working correctly, then high-level rea- 
soning or map making will be unsuccessful, no mat- 
ter how elegant their implementations. Our experi- 
ence also underscores the fact that routines that are 
demonstrated to work in one environment will not 
necessarily work in another environment, even if that 
environment is quite similar. In addition, our expe- 
rience was not unique — no robot at the competition 
(out of a dozen entries) successfully completed the 
task. Obviously, there remains much work to be done 
in mobile robot exploration and navigation of indoor 
environments. 
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Abstract 

Manager's associate systems enable users 
to indirectly manage semi-autonomous 
agents to support collaborative, mixed- 
initiative human-computer problem solving. 
MAX is a software framework for building 
manager's associate systems. It provides an 
architectural model, a domain-independent 
software structure, tools, and reusable 
components for building domain-specific 
elements of systems used by managers to 
develop, execute, and analyze task plans for 
agents. This paper presents an overview of 
MAX and describes its first application: a 
ground vehicle manager's associate system 
for the management of robotic vehicles 
exploring a simulated planetary surface. 

Introduction 

The WIMP user interface (Windows - Icons - 
Menus - Pointing Device) was introduced by 
Xerox's Alto and Star and popularized by the 
Apple Macintosh almost 1 5 years ago. 

These products were followed by several 
generations of new personal computer 
products, each improving hardware and 
software functionality but not extending the 
standard WIMP direct manipulation interface 
concepts. Recently, a number of papers 
have appeared suggesting that a new 
generation of user interfaces is coming that 
will feature semi-autonomous agents that 
perform intelligent functions for the user.’-'*- 
6 . 7 , 9 . 10. 12 i^ay fQ|. example, 
distinguishes between manipulation 
interfaces and management interfaces and 
suggests that the interface of the future will 
enable users to indirectly manage agents, 
not directly manipulate objects.® Similarly, 
Dave Smith argues that we need delegation 
interfaces which support delegation of tasks 
to the computer, not more manipulation 

interfaces.i2 

The rationale for agents and agent- 
management interfaces is that there are 
many tasks for which direct manipulation 
interfaces do not work. Such tasks are often 


tedious, they may require too much of the 
human's time, and they usually require 
processes better performed by the 
computer. Examples include searching 
news and email files for information of 
interest to a human reader,® developing 
tactics plans for an overloaded fighter pilot,® 
and exploring a planetary surface through 
robotic vehicles.® For tasks that can and 
should be performed by the computer while 
the human is doing something the computer 
cannot do, agents will be needed and an 
interface through which the human can 
manage the agents will be required. We call 
the agent management interface the 
manager's associate. 

The manager's associate is an extension of a 
concept of human-computer interaction 
derived from several years of research and 
development in a variety of domains, 
beginning with advanced pilot aiding in the 
Pilot's Associate system.^’ 3- 8 An associate 
system enables collaborative, mixed- 
initiative human-agent problem solving in 
application domains in which the human is 
unable to cope with the scale or complexity 
of the problem solving situation. In such 
domains human information processing can 
be overloaded by very large problem 
spaces, too many simultaneous activities, 
and too much data, but full automation is not 
possible because the task requires human 
perception, judgment, or expertise. The 
associate system employs models of the 
task and the user to provide advanced user 
support, including workload management, 
error recognition and correction, adaptive 
aiding, context- and user-adaptive display 
management, and selective task 
automation.2 

This paper describes a system called MAX, a 
software framework for building manager's 
associate systems. It provides an overview of 
the system and then describes its 
application in one domain: supervisory 
management of robotic vehicles. 


520 


Copy right © 1993 by the American Institute of Aeronautics and Astronautics, Inc. All rights reserved. 


The MAX Framework 

Our goal in developing MAX was to leverage 
the lessons learned, the experience, and 
the architectures developed in the Pilot's 
Associate program to build a generic system 
supporting cost-effective development of 
manager's associate systems. This generic 
system must support development of 
applications in a range of application 
domains, including but not limited to 
avionics. 

MAX is a software framework that provides an 
architectural model, a domain-independent 
software structure, tools, and reusable 
components for building domain-specific 
elements of systems used by managers to 
develop, execute, and analyze task plans for 
semi-autonomous agents. These elements 
can be employed to support problem 
assessment, focusing the user's attention 
on problems of interest, developing an 
effective problem-solving strategy, and 
executing the problem-solving strategy 
through selective task automation and 
human performance assistance. 

Manager's associate applications are defined 
in MAX as a collection of Activities, which are 
major elements of the manager's task such 
as Planning, Execution, Analysis and, for 
control of semi-autonomous vehicles, Tele- 
Operation. These activities have 
subactivities, e.g., the Planning activity can 
be composed of subactivities like Assign 
Objectives to Resources, Plan Tasks, and 
Calculate Performance Measures. Each of 
these subactivities can in turn be composed 
of lower-level subactivities. For each activity, 
MAX provides an Activity Manager, a Data 
Process Manager, an Interface Manager, and 
a Command Monitor. The Activity Manager 
provides control and coordination of system 
resources, while the Interface Manager 
provides managed information to the user 
through a graphic user interface and 
provides mechanisms for the user to interact 
with the system. Monitors maintain and 
process state information of activity-specific 
objects and domain specific data items in a 
global data store. Monitors also signal alerts 
which may be acted upon by the associate 
system or the human manager, as 
appropriate. 

A primary characteristic of manager's 
associate applications is that the human 


needs to selectively interact with large 
collections of data. These interactions 
typically include; 

• Obsen/ing data, supported in MAX by 
modeling the classes of activities the user 
is involved in, locating data relevant to 
those activities, and presenting activity- 
specific abstractions of that data. 

• Monitoring data, supported in MAX by 
data monitoring functions that apply rule- 
based criteria to identify the state of the 
data of interest to current user activities 
and to signal alerts based on this state. 

• Selecting and applying data processing 
operations. For any user activity, MAX 
data handlers can be defined to select 
and apply specific operations to selected 
data, and can specialize the selection of 
operations based on the state of the data. 

• Responding to alerts, supported by a 
mixed initiative task management 
scheme. For any alert condition, MAX can 
identify candidate responses. These 
responses may include: 

- Applying a data processing operation 

- Cueing a new user activity and its 
supporting MAX functions 

- Performing specific computations to 
generate additional options 

- Ignoring the alert condition 

MAX decides how to handle an alert 
condition by applying rule-based decision 
models to evaluate the utility of responding 
to the alert, the legality of each known 
option, the utility of each legal option, and 
the utility of automating the function for the 
user. This process allows MAX to consider a 
number of factors in deciding what options 
to present to the user, whether any should 
be suggested as the “best” action for the 
user, or whether any options should be 
automatically pursued without user 
intervention. The factors that drive this 
mixed-initiative reasoning include the priority 
of the problems the user is facing, the user's 
workload, the user's preferences for 
selecting specific options, the user's 
preferences for various levels of automation, 
and the utiiity of applying specific options to 
the problem at hand. 

The MAX framework provides a set of 
supporting software to enable and simplify 


the construction of domain-specific 
applications that exhibit these capabilities. In 
its current form, the MAX framework provides 
this support by defining three categories of 
application developer tools; 

• A set of “standard” components (and 
tools to define them) that can be used to 
construct MAX application components 
and specify their run-time interactions in a 
high level representation. These include 
“hooks” and tools for specifying domain- 
specific interface components, data 
definitions, and processing procedures. 


and “hooks” to specify user tailoring of 
the application's behavior. 

• A collection of procedural attachments to 
those components that implement high 
level specifications of control interactions 
as low-level run time control decisions. 

• A collection of “default” interface 
components. 
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Figure 1. Run-Time Anatomy of a MAX Application 


System Architecture 

MAX is centered on the concept of 
Activities. The application user, at any point 
in time, is operating in a mode comprised of a 
currently active collection of activities. Each 
activity contributes interfaces, specialized 
tools, data of interest, and data monitors to 
the user's environment while operating in 
that mode. Figure 1 illustrates the interaction 
among activities, composing and maintaining 
a unique combination of sub-activities at run 
time to respond to both the dynamically 


changing set of problems confronting the 
user and the user's own direction for 
reacting to those conditions. 

Each activity in MAX is composed of a few 
basic components, repeated as needed to 
define a complete activity. These 
components, along with their primary 
interactions, are shown in Figure 2. The top 
level MAX activity, upon activation, creates 
its interface manager and data manager. The 
interface manager in turn creates appropriate 





instances of MAX'S built-in interface objects 
(windows, menus, buttons, presentations, 
etc.) along with those domain-specific 
interface components specified in the 
activity definition. 

The activity also creates a data manager, 
which establishes data class monitors to 
detect objects of interest in the data store. 
Each object of interest found by these 
monitors (either immediately or at any time in 
the life of this activity) causes a data-event- 
handlerto be created, and a data-instance- 
monitor to be established. This instance 
monitor detects any changes in the object of 
interest, and triggers data change events. 
These in turn cause the data-event-handler 
to reevaluate the object of interest, and may 
cause it to either trigger associated data 
processing operators, or may trigger an alert 
condition. For data whose impact on the 
problem may change over time rather than 
over changes in value, a clock-driven event 
timer is provided to re-trigger data 
evaluation. 


In an alert condition, an alert is asserted. The 
alert in turn generates a set of possible 
options to deal with that alert. Options 
consist of three classes of operations; data 
processing procedures: procedures that 
generate new options; and invocation of 
sub-activities to help the user solve the 
problem. Next, the alert determines which 
options are legal in the current context, and 
then evaluates the utility of applying each 
legal option. For the best options, the alert 
also evaluates the utility of automating the 
option’s execution, as opposed to 
presenting it for the user's consideration. In 
each of these steps, “evaluating utility” 
employs rule-based decision models that 
consider problem characteristics, the user's 
current problem solving workload, the user's 
personal preferences among options and for 
his desired level of automation support, and 
models of the capabilities and current state 
of the MAX application. 
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Implementation 

MAX was developed in Macintosh'''” 
Common Lisp, version 2.01 p3 and 
Expertelligence’s Action!™, version 3.0. It 
runs on a Macintosh NCI with 20MB of RAM, 
an 80MB hard drive, and a 13 " or larger color 
or gray scale monitor. The system can run in 
other Macintosh environments, including a 
Poweitook™, with modification of the map 
displays. 


The GVMA 

MAX is designed to support a wide range of 
manager's associate applications. The 
choice of the Ground Vehicle Manager's 
Associate (GVMA) was made in collaboration 
with NASA Ames to support a successful 
demonstration of simulated planetary 
expbration using IS Robotics, Inc. robotic 
vehicles.* 

The GVMA has three main activities, the 
mission planning activity, the mission 
management activity, and the vehicle tele- 
operation and video survey activity. A robot 
simulation activity was also created to 
provide an internal vehicle simulation that 
can be executed from within the GVMA. 

Monitoring the activities of all vehicles, the 
system alerts the manager to events 
requiring human attention, provides options 
for human action, and delivers vehicle 
commands that implement options invoked 
by the manager. The manager monitors the 
sensors and can directly control each vehicle 
through the Tele-Operate activity window, 
which includes a live video feed from the 
vehicle. The overall activity of the vehicles is 
displayed in the Mission Management 
window. The GVMA observes the 
performarxie of each activity against a 
mission plan. When a situation triggers a 
nrK>nitor, the GVMA displays an alert and 
suggests actions to the manager through an 
Alert window. If the manager invokes one of 
the suggested options, the GVMA executes 
the selected action. Figure 3 shows an 
example in which an alert has signaled 
readiness to perform a search activity at a 
location on the planetary surface. At this 
point, the manager can tell the system to 


IS Robotics, Inc., Twin City Office Center, Suite 
6, 22 McGrath Highway, Somerviile, MA 02143. 


invoke the search immediately, wait for the 
start time specified in the mission plan, or 
wait for further direction. If the activity is 
invoked, the GVMA will cause the vehicle to 
proceed to the search area via the specified 
waypoints. 


Assessment 

The GVMA demonstration showed that MAX 
provides a framework for building an 
associate system for managers of multiple 
vehicle missions. We plan to apply MAX to 
the development of manager's associate 
systems in different domains in the near 
future. 

Using MAX, the GVMA was developed in two 
person-months, including interfacing to the 
robot vehicles. We did not attempt to build a 
non-MAX GVMA in order to compare 
development time and difficulty, but based 
on our experience we believe that an 
application of the complexity of the GVMA 
would have required at least six person- 
months if developed from scratch. 

Although this suggests that MAX has value 
in developing manager's associate 
applications, it is not a finished product. We 
think that it was helpful in developing the 
GVMA, but enhancements are required. 
Facilities for configuring communications 
interfaces and for developing task, 
environment, and user models work, but 
they are difficult to use. In addition, the 
software is not as robust as it should be. 

MAX supported the development of the 
GVMA, but it has not been tested in a variety 
of applications, and such testing will almost 
certainly reveal undetected bugs. Finally, 
more effort needs to be invested in 
developing planning and decision analysis 
functionality. The decision analysis module 
works, but it is fairly primitive and decision 
models are difficult to build and revise. The 
current planning module requires the user to 
develop plans using a very simple editor, 
and there is no replanning capability. 
Determining the effectiveness of the GVMA 
and other manager's associates is 
problematic. As noted by Sheridan,” the 
objective function of supervisory control is 
not fixed and cost and complexity make it 
extremely difficult to conduct controlled 
experiments in this domain. It is certainly 
possible to demonstrate the superiority of 
the GVMA over manual control, but 
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understanding the relative effectiveness of 
the many forms and combinations of 
associate features and behaviors is another 
matter. We are investigating ethnographic 
and usability inspection methodologies as 
adjuncts to the methods of experimental 
psychology. 

Summary and Future Work 

Agents and manager's associates are 
required by many important new 
applications, and we will be seeing 
commercial implementations of such 
interfaces in the near future. The first 
products will probably support information 
access applications that employ agents in 
storing, retrieving, manipulating, and 
understanding massive amounts of 
information. The management of intelligent 
devices such as remote vehicles, 
manipulators, and instrumentation will be 
another important application. These 
applications will require a manager's 
associate, and we believe that cost-effective 
development will require a framework like 
MAX. 


continue work on the system, improving it as 
we use it build new applications. First steps 
are enhancement of the planning arxl 
decision analysis functionality. We are also 
working with IS Robotics, Inc. to extend the 
GVMA and interface it to new micro-robot 
platforms. 
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Abstract 

We are investigating visually-based deictic 
primitives to be used as an elementary 
command set for general purpose navigation. 
Each deictic primitive specifies how the robot 
should move relative to a visually distinctive 
target. The system uses no prior information 
about target objects (e.g. shape and color), 
thereby insuring general navigational 
capabilities which are achieved by 
sequentially issuing these deictic primitives to 
a robot system. 

Our architecture consists of five control 
loops, each independently controlling one of 
the five rotary joints of our robot. We show 
that these control loops can be merged into a 
stable navigational system if they have the 
proper delays. We have also developed a 
simulation which we are using to define a set 
of deictic primitives which can be used to 
achieve general purpose navigation. Encoded 
in the simulated environment are positions of 
visually distinctive objects which we believe 
will make good visu^ targets. We discuss 
the current results of our simulation. 

Our deictic primitives offer an ideal solution 
for many ty^s of partially supervised robotic 
applications. Scientists could remotely 
command a planetary rover to go to a 
particular rock formation that may be 
interesting. Similarly an expert at plant 


maintenance could obtain diagnostic 
information remotely by using deictic 
primitives on a mobile platform. Moreover, 
since no object models are used in the deictic 
primitives, we could imagine that the exact 
same control software could be used for all of 
these applications. 

1. Introduction 

We are developing a robot architecture which 
uses a natural deictic interface that allows the 
user to point out targets to the system. To 
operate a deictic mobile robot, the user would 
select a target in a video image and then issue 
a command such as "approach that" or "pass 
to the right of that" where 'that' is the target 
selected in the video image. In this paper, we 
describe the robot architecture that we are 
using for this deictic system. We also 
describe our simulation environment that we 
are developing to explore the definition of a 
set of deictic primitives to be used for general 
purpose navigation. 

This work is important since the elementary 
deictic primitives give researchers a novel 
way to think about programming robot 
systems. Most robots are controlled by 
specifying a target in geometric terms, for 
example as a Cartesian position and 
orientation (e.g. 'go to 20m, 12m, and face 
10 degrees') or as a location on a map. On 
the other hand, deictic primitives would 
involve a user pointing out a sequence of 


* This work is supported by the National Science Foundation under grant number IRI-921056. This work is also 
aided by the donation of a Cojinex 4400 Machine Vision System by Ccenex Corp.. Needham, MA. 
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visual targets and the robot moving relative to 
those targets. We believe that this type of 
programming interface is more natural for 
humans since people tend to move relative to 
what they perceive. For example, we would 
'walk to the doorway' rather than 'walk 
forward 10 feet'. As our work progresses in 
the future, we will add object models so that 
our system would be able to 'approach the 
doorway'. Therefore, we believe that deictic 
commands would be a more natural method 
for people to interact with a mobile robot 
system. 

This deictic interface is vety different than 
interfaces to traditional mobile robots. Many 
robots are controlled by specifying a target 
location in geometric Cartesian coordinate 
with respect to an initial robot location. In 
this case, the robot must keep track of its 
location in order to know if it has reached the 
goal location. Other mobile robots navigate 
with respect to a map of the environment 
where goal locations are specified by a 
geometric coordinate on the map. The robot 
must continually track its position with 
respect to the map to determined if it has 
obtained its goal. Still other robots navigate 
to target objects which have pre-stored 
models so that the robot can identify 
landmarks. In all of these traditional 
approaches to interfacing with the robot, 
environmental knowledge must be encoded 
geometrically for the system to operate. 

Our deictic system is very different in that the 
robot only needs to keep track of the 
destination object in it video field. Since 
target tracking is more robust than object 
identification, the processing time of our 
system is decreased. The robot does not 
need to keep track of its location with respect 
to a global map, therefore our system is not 
susceptable to position tracking errors. We 
take advantage of movable camera systems to 
simplify our robot control architecture. 

This deictic interface for semiautonomous 
robots has many applications, especially in 
exploratory robots. Scientists can control a 
planetary rover by selecting a location of 
interest in the video screen and commanding 


the robot to go to that area. Underwater 
robots can be controlled with lower 
bandwidth communications than is typically 
necessary for remotely operated vehicles. 
Moreover, semi-autonomous robots have 
applications in aids for the handicapped. 

In this paper, we overview the robot 
architecture which uses five feedback control 
loops to control the motion of the robot. We 
show that with the time constants on the 
feedback loops that this system can provide 
smooth and stable motion of all joints of the 
robot. We also present our initial work on a 
simulator for exploring the definition of a set 
of deictic primitive commands. We show the 
results of this simulation for a series of 
approach commands. 

2. Related Work 

Developing mobile robot systems based on 
traditional computer vision and robotics 
paradigms requires the use of an a priori 
object model for the goal and a reference 
coordinate frame [16] [20]. The vision 
system identifies the goal in the scene by 
using the a priori object model provided. 
The object positions and orientations are 
peiceiv^ in the camera coordinate frame and 
must be transformed into the reference 
coordinate frame and added to the world 
model. Other sensor modules add 
information to the world model. Motion 
decisions for the robot system are made by a 
path planning module using the most recent 
information from the sensors which has been 
integrated into the world model. As the robot 
moves, the system must record and update 
the robot’s position within the world model. 
This system has been used in many robotic 
systems including [21] [11]. This traditional 
solution is somewhat limited since it assumes 
that prior object models are available, which 
is often not the case in applications such as 
planetary exploration and household robotics. 

Similar systems, for example [13], construct 
a world model without having the a priori 
object models. However, the world model 
construction process is computationally very 
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Figure 1: Robot Head. Our robot head has 
four joints. The first joint controls ''Bh, the 
pan of the head with respect to the robot base. 

The second joint controls the tilt, t, of the 
cameras. The third and fourth joints control 
the pan of the cameras 

expensive. These systems require calibration 
between the camera system and the robot, a 
localization routine so that the robot can 
identify its location with respect to the local 
map (so that the world model can be 
integrated over time), and a good kinematic 
and dynamic model of the robot system. The 
calibration, kinematic, and dynamic models 
always have associated with them some 
approximation errors. Motion planning, 
which is done on the world model, can 
become difficult as the robot modeling errors 
accumulate. 

Visual servoing techniques have been 
proposed to eliminate the geometric 
dependence of the motion commands. Rather 
than directing the robot to a destination 
location, the robot is instructed to maintain its 
visually apparent position with respect to an 
object using dynamic visual feedback. Robot 
manipulators with a camera mounted on the 
arm can now track specific objects in 3-D 
space [22] [10] and navigation systems can 
track pathways [6] [9]. These systems work 
in real-time by tracking a specific visual 
feature rather than reconstructing a complete 
3D description of the world. 


Other researchers have abandoned traditional 
methods and instead have promoted 
behavior-based robotic architectures and local 
path planning algorithms [1] [3] [4] [12] 
[19]. These systems tend to use a distributed 
computer system to acheive tightly coupled 
control loops between the sensing and 
actuation. Therefore these systems have 
better reaction times in the presence of 
moving objects. Ultrasonic sensors are a 
common choice to provide fast obstacle 
detection [2] [14]. 

Our system currently uses a simple and fast 
method for determining the motion of the 
robot and most closely resembles these 
behavior based systems. Therefore our 
system is able to react quickly to a moving or 
newly detected obstacle. We use a visual 
servoing technique to position the gaze of 
each camera directly at the target The mobile 
robot then moves in the gaze direction of the 
cameras if the pathway is clear of obstacles. 
Otherwise it moves around the obstacle and 
continues seeking the target. 

3. Mobile Robot 
Hardware 

Our experimental equipment consists of a 
mobile robot base with a ring of ultrasonic 
sensors, an active robot head, and a high 
speed video processor. The active robot head 
has four controllable motions. The robot 
head carries two cameras and controls the pan 
of each camera individually and it controls the 
tilt and pan of the pair of cameras, as shown 
in Figure 1. This platform is similar to those 
described in [5], [15], and [17]. The 
platform was construct^ such that the pan 
and tilt of the cameras occur approximately 
about the focal point of the cameras. A 
Cognex 4400 Machine Vision system is 
currently handling the real-time video 
processing of the cameras. The active camera 
head is mounted on a mobile robot platform 
with a ring of 24 ultrasonic sensors. Each 
ultrasonic sensor can determine the distance 
to the closest object in a 30° field of view. 
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4. System Architecture 

Our goal is to achieve fast, reliable pursuit of 
a target while avoiding obstacles in the path. 
Our system includes three components; a 
target tracker, obstacle detector, and mediator 
as shown in Figure 2. The target tracker 
follows the target location selected by the 
user and reports the angle and distance of the 
target to the mediator. The active robot head 
is used to simplify the target tracking task. 
The obstacle detector reports the 
measurements from the ultrasonic sensor 
ring. These measurements are the distance to 
the closest object within the field-of-view of 
each sensor as a function of angle from the 
robot. The mediator then determines the 
speed and steering angle of the robot. In the 
following subsections, we describe in more 
detail the three components of this system. 

4.1. Tracker 

The tracker is responsible for reporting the 
angle and distance to the target. Since we are 
focusing on a video interface, we will be 
using targets from video images from the 
stereo cameras. We are using stereo cameras 
to determine the distance to the target While 
determining the distance to a stationary target 
is possible from a moving platform with a 


known motion, we do not assume that the 
target is stationary nor that the motion of the 
target is known. As the robot and target are 
moving, the tracker must determine the 
location of the target in the image. Since the 
target can easily move outside of the field of 
view of the cameras, we use an active robot 
head to keep the target in sight and thus to 
simplify the tracker. 

The tracker operates as four independent 
controllers, one for each motion of the 
camera head: right camera pan, left camera 
pan, head pan and tilt (see Figure 1). The 
target is first located independently in each 
stereo image. The camera pans, 0d and 0cr, 
and the head tilt x are used to move the 
cameras such that the position of the target 
appears in the center of the stereo images. 
The head pan is independently controlled to 
try to face the cameras directly at the target. 
The angle to the target can then be directly 
measured from the pan of the robot head. 
The angles of the stereo cameras with respect 
to the robot head can be used to compute the 
distance to the target For more details of this 
controller see [7] and on video tracking [8]. 

4.2. Obstacle Detection 

The sonar system is responsible for reporting 
the locations of obstacles surrounding the 
vehicle. In a typical ultrasonic system, each 



Figure 2: System Overview. Target tracking uses the active robot head to report the direction and distance 
of the target relative to the mobile robot base. Obstacle detection reports the distance to the closest object 
within the field-of-view of each sonar sensor. The mediator pic^ the best speed and steering angle 
commands for the mobile robot base. 
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sonar covers a 30° field-of-view. The object 
which is closest within this field is detected 
by the sonar. The sonars are spaced in a ring 
around our platform. The mediator receives 
the result of each sonar individually. These 
readings can be thought of as the cost of the 
robot traversing in that direction. 

4.3. Mediator 

The mediator decides the steering and speed 
commands that will be sent to the mobile 
robot The tracker reports to the mediator the 
current direction and distance to the target. 
The obstacle detector determines a radial map 
of distances to obstacles surrounding the 
vehicle (see Figure 2). Interestingly, we 
found that the mediator need not be complex 
to steer the robot successfully. 

Consider that the robot can only steer within 
the resolution that it can sense. Therefore, to 
track the target in an image, the robot can 
steer according to the resolution of the pixels 
in the image. However, if obstacles are 
detected, the robot only knows that an 
obstacle appears within a 30° field-of-view. 
Therefore, the robot can only steer in 30° 
increments. Each ultrasonic reading 
corresponds to a steering direction. If an 
ultrasonic sensor detects an obstacle, then the 
robot should not steer into the 30° field-of- 
view of the detecting sensor. 

If there are no obstructions in the direction of 
the target, then the robot pursues the target 
direction. If there is ciurently an obstruction 
in the direction of the target, the mediator will 
select the closest open steering angle to the 
target. 

The mediator also considers the closest 
obstacle and the distance to the target when 
selecting the vehicle speed. The speed is 
inversely proportional to the distance to the 
closest object. We pursue the target to within 
a fixed distance. For safety reasons, the 
robot's speed is also clipped to a maximum 
value. 


4.4. Simulation 

To show the competence and stability of the 
system we have simulated a robot motion 
model to test our navigation algorithms. To 
ensure a realistic simulation, we have 
modeled each motion of the robot as a 
second-order system. The motion of the 
robot joints is modeled as a damped response 
to the desired motion commands issued by 
the mediator. 

At each step in our simulation, two camera 
images and 24 ultrasonic measurements are 
taken of the environment. We assume that 
these measurements are relatively accurate. 
We completely model the limited field of 
view of the cameras and the quantization of 
the camera measurements. We also add 
random noise to these measurements. The 
ultrasonic measurements also have noise 
added and we model a 30° field-of-view of 
the ultrasonic sensors. 

The simulation keeps track of the motion of 
the target and the motion and orientation of 
the robot with respect to a world coordinate 
frame. Notice that in our architecture, the 
robot does not know about a world 
coordinate frame since it has no world model. 
The robot only concentrates on pursuing the 
target location and it considers its location in 
the world irrelevant. For the purpose of 
display and sensor input computations, we 
represent locations of objects, targets, and the 
robot with respect to a world coordinate 
frame. Our simulation is two-dimensional, 
ignoring the z axis. Therefore, the tilt of the 
camera head is not simulated. 

In the following subsections, we describe the 
simulation of the camera input, the sonar 
readings, and the motion model of the robot. 

4.4.1. Camera Pan and Tilt 

Simulation 

For our simulation, we currently do not 
model projection, back projection, and 
camera measurements. Instead, we compute 
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the desired angle for the camera pans by 
transforming the position of the target to the 
camera frame. TTie transformation between 
the camera frame and the world coordinate 
frame is updated as the robot moves. 

4.4.2. Ultrasonic Measurement 

Simulation 

The obstacles in our simulation are 
represented by their comer locations. For 
each comer of an object, the position of each 
comer is transformed to the coordinate frame 
of the robot. We then compute the angle to 
this location to determine in which of the 
ultrasonic measurements this comer will 
appear. If the new distance, with additive 
noise, is less than the current minimum 
distance know by that sensor, then the sensor 
measurement is updated Given the range 
ultrasonic sensor in the ring effected by each 
object allows us to compute the intermediate 
sonar values. 

4.4.3. Motion Control 

We model each joint motion as a second- 
order system. We assume that the joint 
controller is critically damped and that the 
discrete inputs from the computer controller 
are modelled by step input functions. This 
type of motion is achieved by using a 
proportional-derivative (PD) controller. 
These PD controllers have been successful in 
controlling the vergence of stereo cameras on 
a robot platform [18]. The motion response 
to the desired input is shown in Figure 3. 
The equations of the response function is: 

0(t) = 0d(l-exp(t/x)) 

where t is reset to zero when 0^ changes. 0** 
is the desired angle of the joint that is 
computed by our joint motion algorithms 
described previously. 0d is a piecewise step 
function since it is being computed by a 
discrete controller, x is the time constant of 
the system which controls how fast the joint 
can track the desired input. We also limit the 
velocity of each joint and we insure that the 
motion of each joint stays within its range. 


Our current parameter values for the time 
constant and maximum velocity for each joint 
is summarized below: 

Xcr = 50 IcOcrlmax = 90 deg/sec 

Tcl = 50 l0)cllmax = 90 deg/sec 

Xh = 10 ItOhImax = 60 deg/sec 

Xr = 5 Iwrimax = 30 deg/sec 



4.5. Results 

We have run the simulator on numerous 
examples and we show a couple of results 
here. In all attempted scenarios, we have 
successfully arrived at the target location 
without colliding with obstacles. In the Hrst 
example, we assumed a stationaty target at 
location (10,7) with respect to the initial robot 
frame (see Figure 6.) Recall that the x 
coordinate of the robot frame specifies its 
direction of motion. Since our slowest time 
for processing a single frame was 100 
milliseconds, we used this time as the 
sampling period of the system. We assumed 
that the vehicle could travel a maximum of 3 
meters/second. 

We present a test sequence where the target is 
at the limit of the cameras' field-of-view. 
ThCTefore, the desired pan of the cameras will 
be at its largest possible value. We 
demonstrate to show that the system is stable 
and controls the head and robot motions 
smoothly even given the largest step input to 
the system. 
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Figure 4 show the motion of the left and 
right cameras with respect to time. As the 
robot begins it journey, the cameras first 
notice that the target is about 40° to the left of 
the robot. The cameras begin to pan to the 
target and the head begins to pan to face the 
cameras toward the target. The system 
normalizes when the angle of the head and 
the cameras is small. In this case, the angles 
between the left and right cameras will 
become equal in magnitude and opposite in 
sign. This occurs at about 1 second. This 
angle magnitude remains close to zero while 
the target is far away, but as the robot 
approaches the target the cameras begin to 
verge. The magnitudes of the two camera 
angles are still about equal which indicates 
that the pan of the head is still correctly facing 
the target. When the mobile robot arrives at 
the target location at about 4 1/2 seconds the 
left and right camera angles are verged at -60° 
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Figure 4: Left and Right Camera Angles. Initially 
the robot and the camera head are facing away from 
the target at about an angle of -40°. The cameras and 
pan stabilize on the stationary target location at about 
1 second. From then on the magnitudes of the 
camera angles are approximately equal. The robot 
arrives close to the target at approximately 4.5 
seconds. 


and 60® respectively. This angle can be used 
to compute the distance to the target. When 
the simulation was allowed to mn to acquire 
the target, the camera angles became -90° and 
90® respectively. 

Figure 5 shows the angle of the camera head 
over time. Confirming what we noticed in 
the camera angles, the pan motion becomes 
zero as the cameras are stabilized on the target 
location at about 1 second. Notice that when 
the cameras first observe that the target is at 
40® the robot head begins to pan to face the 
cameras toward the target. The pan of the 
head never gets all the way to 40® since the 
robot itself also turns in the direction of the 
pan. As the system stabilizes, the pan of the 
head is zero since the robot is facing the 
target. 
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Figure 5: Robot Head Angles 





Figure 6 shows the path of the robot to the 
stationary target at (10,7). The robot avoids 
a couple of obstacles that were placed close to 
the straight line path to the go^. Notice that 
the motion of the robot corresponds to 
smooth forward trajectories that would be 
possible with a nonholonomic robot that 
would be steered similarly to an automobile. 

Finally, Figure 7 show the path of the robot 
tracking a moving target. The target is 
following a circular path with a changing 
radius. The target locations, denoted by an 
'x', begin at position (10,7) and end at 
position (10.4, -4.75). The interesting thing 
is that even though the robot is not estimating 
the motion of the target, the path developed 
by the visual pursuit algorithm seems to 
anticipate the new location of the target and 
correcdy intercepts it. 

4.6. Discussion 

In our system, the motion of the camera 
head, panning the two cameras toward the 
target, is a redundant motion with the steering 
of the robot. This motion is necessary to 
allow the robot to freely manuever around 
obstacles without allowing the target to move 
outside the field-of-view of the cameras at the 
maximum camera angles. This gives the 
robot the freedom to track a target that may 
even move behind the robot. 

The architecture is very simple and provides 
for much of the navigational and path 


planning abilities necessary in the system. 
Unlike other path planning research, we are 
not focusing on singular conditions in the 
path planning (e.g. trapping in 'U' shaped 
obstacle on path to the goal.) This is because 
our system inherently has a human in the 
loop, who can select a new intermediate 
target to move the robot away for the trap. 

We discovered that the all the joint motions 
will oscillate if the response times of the 
camera pans, head pan, and robot turning are 
the same. Smooth paths were generated and 
smooth positioning of the cameras were 
obtained only if the response of the camera 
pans are faster than the response of the head 
pan which in turn is faster than the response 
of the robot. 

5. Deictic Command 
Simulation 

We have also extended our previously 
described simulation to explore the deictic 
primitives that are necessary to perform a 
general purpose navigation. Our goal is is to 
catalog a large number of environments and 
the visually interesting or trackable features 
of the environment Each environment also 
has a set of possible goal locations. Using 
this simulator, we test if the robot can 
traverse from all starting locations to all 
possible goals using deictic commands in 
reference to the visually distintive to the 
targets. 

We read polygonal environment descriptions 
from an input file. We also mark on these 
files, objects in the environment which we 
feel are easily trackable by our video system. 
We currently have descriptions of a standard 
living room and the third floor corridors of 
one of the buildings at Northeastern 
University. 

Currently, we have implemented an approach 
command where the robot directly 
approaches the target location. We show 
examples of paths taken by our robot when 
commanded to approach a sequence of 
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targets. The data depicts the corridors of the 
Northeastern University engineering building 
and we navigate to targets which we feel are 
trackable by video systems in the corridors. 
In Figure 8, we show the robot navigating in 
the corridors from just outside the elevators 
on the third floor of our Snell building to the 
doorway between Snell and Dana. The robot 
is issued three approach commands: The first 
target is the sign on a vending machine near 
the end of the first corridor. The second 
commands approaches a doorknob on the 
door at the start of the second corridor. The 
final command approaches the sign on the 
door at the end of the corridor. 

In Figure 9, the robot goes to an office in 
Snell, again from outside the elevators. The 
robot first approaches the fire alarms 
mounted on the wall to the left near the end of 
the first corridor. Then it approaches a sign 
on a door office to round the corner. A 
second alarm becomes the next target, and 
finally, the poster in the office is used to 
navigate the robot into the office. 

6. Conclusions and 
Future Work 

Our initial work on integrating an active robot 
head into a navigation scenario has been 
extremely promising. We have shown that a 
simple, 'follow your eyes' scenario is 
sufficient for tracking a moving target. In 
our situation, we do not plan extensive paths 
through the field of obstacles but we rely on a 
low resolution sonar sensor to detect obstacle 
locations. The motion of the joints on the 
robot head is smooth and can react to step 
changes in the target location. We enforce in 
our simulation a reasonable model of the 
response of the mechanical systems and the 
limitations of velocity and acceleration. 
Because of this modeling of the robot motion 
latency, the simulation produces realistic 
paths of the robot. 

We are implementing our algorithms on our 
hardware platform and intend to develop 
algorithms for obstacle detection using the 


active robot head. We wiU test this algorithm 
extensively to determine what steps we will 
need to improve the algorithm to acheive 
better performance in many environments. 
We will also begin working on vision 
algorithms that can robustly track many 
targets. We want to develop a number of 
visually directed commands useful for 
general navigation. Later, we will extend this 
work to include targets and orientation 
constraints. We hope to eventually develop a 
set of visual commands for manipulation as 
well. 

Not only does this system provide solutions 
in current semi-autonomous applications, it is 
also an alternative philosophy for developing 
fully-autonomous, general-purpose mobile 
robot systems. Many researchers are 
developing autonomous mobile robots which 
can navigate in limited situations, for example 
road-following or corridor tracking. Their 
philosophy is to merge autonomous systems 
performing specific tasks and to derive a 
general purpose autonomous system. We, 
on the other hand, are developing a robust 
mobile robot which can navigate in general 
situations. To make general mobility 
possible, our system will rely on more 
human interaction than typical mobile robot 
systems. Over time we will decrease the 
amount of user interaction by adding general 
environmental knowledge to the system 
thereby increasing the autonomy of the 
system. This will result in systems that are 
easily configmed to a number of applications 
including underwater and space exploration, 
flexible manufacturing, and robotic 
wheelchairs. 
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Figure 8: Robot path from outside elevator to the door between the Snell and Dana buildings. 
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ABSTRACT 

In this paper a methodology is presented for the generation 
of a 2-D map from an unknown navigation environment by 
traveling a short distance. The methodology proposed here 
is based on the synthesis of the knowledge extracted from 
consecutive free navigation spaces, during the movement of 
an autonomous mobile robot. The generation of the 2-D 
map of the space is classified into three cases: (a) space 
without obstacles; (b) space with standing obstacles; and (c) 
space with moving obstacles. 
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1. INTRODUCTION 

Knowledge acquisition from an imknown navigation space 
is one the challenging problem facing in the modem 
robotics (intelligent robots) and AI today [1-8]. In realistic 
situations, only the boundary and the outer shape of the 
current free navigation space is known and the interior 
stmcture and/or formation of the space are totally unknown. 
The space that we are talking about, could be the surface of 
an unknown planet, destroyed battlefield, or demolished 
landscape. Depending on the situation, traveling through the 
navigation space by human may be hazandrous, expensive, 
time consuming, inefficient and most of all may be life 
threatening. 

A technique for the generation of the 2-D space map was 
proposed by [3]. This method scans the entire area. When 
stationary obstacles exist inside the navigation space, this 
method goes around of each obstacle and at the end 
generates the complete 2-D sapce map. In case that the 
obstacles are moving in the navigation space this 
methodology does not work. Moreover, it is a time 
consuming approach, especially when the number of 
stationary objects is great. 


In this paper we present a formal methodology, which 
extracts knowledge by traveling through an unknown 
navigation space and generates a complete 2-D map of the 
navigation environment. Little or no knowledge is assumed 
regarding the navigation space and knowledge is 
accumulated solely by traveling in it. The methodology of 
generation of the 2-D map has been studied under certain 
space conditions: 

a. Space without any obstacles 

b. Space with stationary obstacles 

c. Space with moving objects 

The methodology of generating the 2-D map is based on the 
graph modeling of the navigation space and the synthesis of 
the successive free navigation spaces. Moreover, the 
proposed methodology generates the 2-D map by traveling 
a short distance in the total space. 

Ihis paper is organized into six section. Section 2 provides 
some definitions and notations. Section 3 deals with the 
representation of the knowledge extracted from the 
navigation space. Section 4 presents the synthesis of the 
shape-graphs. Section 5 discusses the generation of the 
space map and section 6 concluded the overall presentation. 


2. NOTATIONS AND DEFINITIONS 

In this section we provide a number of notations and 
definitions in order to accommodate the understanding of 
the following sections. 

Notation 1: GNS represents the global navigation space. 

Definition 1: A 2-D obstacle, b(j), jeZ.is defined as the two 
dimensional surface, Sb(j) CT GNS, where a moving object 
(robot R) cannot go through it, and the visual and laser rays 
stop or reflect on it. 

Definition 2: A robot, R(n), neZ, is a moving "obstacle” in 
GNS by using its own power to do so. 

Notation 2: BS represents the set of all the obstacles in 
GNS, 

BS C GNS. 

Notation 3: RS represents the set of all the robots in GNS, 
RS C GNS. 
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Notation 4: Sr(n) represents the 2-D surface covered by a 
lobot R(n) in GNS. 

Notation 5: E(b) = U {Sb(j)} and E(r) = {Sr(n)} represent 
the total surfaces covered by the total number of obstacles 
and the total number of robots in GNS respectively. 

Definition 3: The total free navigation space inside GNS, is 
defined as FNS = (GNS - [E(b) + E(r)]). 

Notation 7; t(i), ieZ, represents the current time. 

Notation 8: NS(t(i)) represents the navigation space at the 
time t(i). 

Definition 4: The current free navigation space FNS (t(i)) is 
defined as the free space perceived by the robot R(n) at the 
time t(i), FNS"(t(i)) C FNS. 


3. KNOWLEDGE EXTRACTION AND 
REPRESENTATION FROM THE FREE 
NAVIGATION SPACE 

In this section the extraction and represention of knowledge 
from the ^ometric form of the current free navigation 
space FNS (t(i)) perceived by a moving robot R(n) are 
presented. In particular, the extraction is related with the 
construction of the shape SH (FNS**(t(i)) of the current free 
space. Figure 1 shows graphically the generation of the 
shape. Then, the relationships among the straight line and 
curve line segments of the shape SH*\t(i)) are defined and 
the representation of the shape (knowledge) with the use of 
syntactic and semantic information is obtained by using 
directed graph with attributes [6]. 
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Figure 1 : a) Construction of the FNS shape 
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Figure 1: b) Extraction of the FNS shape 


4. SYNTHESIS OF CONSECUTIVE SHAPE-GRAPHS 

In this section, we describe the synthesis of successive 
shapes expressed in graph forms [5]. 

n n 

Two shapes SH (t(i)), SH (t(j)), with i#j can be considered 
for synthesis if the graph form of these shapes satisfy the 
following basic proposition: 

Proposition : Two shapes SH (t(i)), SH (tQ), i#j, extracted 
by the same robot R(n) at two consecutive time intervals in 
the same navigation space, can be considered as candidates 
for composition into a new shape SH"(t(ij)) if and only if 
their graph forms have at least one common node, G (t(i)) 
> N(k) = N(m) € G (t(j)) with the same properties Pr and 
the similar relationships Rs with the other nodes, where 
Pr= { size, color, length, curvature, etc } , and Rs={ connectivity, 
parallelism, symmetry, relative-distance, relative- 
magnitude,etc}. 

Starting the synthesis process, we have to search initially 
the graph form of each shape for graph-nodes that satisfy 
the proposition above. If we detect such a node in the first 
graph, then we save its characteristics and we proceed with 
the nodes of the second candidate graph. If there is at least 
such a node in the second graph, then we attempt to match 
its characteristics with the corresponding ones of the node, 
which belongs in the first graph. If there is a successful 
matching, then these particular nodes will be the starting 
point for the synthesis of the two graphs. More specifically, 
the synthesis process of two consecutive shapes (using their 
graph-fonns) is based especially on the connectivity 
relationship (angle) of the nodes with the same properties. 
Figure 2 shows graphically, the synhesis of straight line 
segments where the segment with the maximum clockwise 
angle is eliminated. The synthesis process of the rest nodes 
for these two candidate graphs is based on the detection of 
closed subgraphs and determination of their "extended" new 
subgraph forms. 




Figure 2: Synthesis of staight line segments 
taken from two consecutive shapes. 

a) Matching of the segments 

b) The new segments after synthesis 


At the end of the synthesis process, the two shapes 
generated a new shape which represents the map of two 
consecitive free navigation spaces. By repiting this synthesis 
process, finally the map of the navigation space will be 
produced. Noe there is a critical question. For how long 
does a robot have to travel in order to generate a complete 
2-D map of an unknwon space? The next section attepts to 
give some answers to the question above. 


5. GENERATION OF THE SPACE MAP BY 
TRAVELING A SHORT DISTANCE 

The generation of the space map requires the classification 
of the unknown space into three main categories: 

. Space without Obstacles 
. Space with stationary obstacles 
. Space with moving obstacles 

5.1. SPACE WITHOUT OBSTACLES 

Here the problem is to generate the 2-D map of the 
navigation space by traveling a short distance, under the 
condition that no obstacles exist inside the space. The 
solution is rather trivial if the shape of the space is regular 
geometric one, as shown in figure 3. In this case, the shape 
of the space is simple and the center of gravity (or 
geometric center) Cgis ease to be located. Thus, if a robot 
detects such a shape then it calculates the center of gravity. 
This means that the robot has to travel a distance 
d[p(x,y),Cg] from its current location p(x,y) to Cg. When 
the robot will reach the Cg then its confidence function 
takes its maximum value, thus the 2-D map can be 
generated. 



Figure 3: Some primitive geometric shapes 


In case however where the shape of the navigation space is 
not a regular geometric shape, the problem becomes more 
complex, see examples in figure 4. For these cases, we 
partition the shape of the navigation space in order to 
obtain a set of simple (primitive) geometric regular shapes, 
figure 5. Thus, for each primitive we define a center of 
gravity. At this point, all the centers of gravity are 
connected by straight line segments (if possible) and the 
robot has to reach the nearest center of gravity. From that 
center of gravity the robot will travel on the line segments 
(gravity-line), which connect the centers of gravity, by 
minimizing the traveling distance for the generation of the 
2-D space map, see figure 6. Note that the confidence 
function takes its maximum value by traveling on the 
gravity-line. It is also important to be noticed that if the 



Figure 4: Compiex geometric shapes 



Figure 5; Partitioning of a geometric shape 
into a set of primitive shapes. 



Figure 6: Connection of the centers of 
gravity. 







navigation space is partitioned into "n" regular geometric 
shapes and if all the centers of gravity are connected within 
the space, then visiting these Cg points in an optimal way 
is equivalent to finding a permutation gl,g2,g3,...gn, which 
minimizes the total traveling distance (traveling salesman 
problem). A heuristic solution is proposed here in order to 
avoid such an NP complete problem. The solution is 
coming by generating the "skeleton" (thinning) of the 
navigation space [9], see figure 7. Thus, the complexity of 
the problem is reduced by eliminating the partitioning 
process and the complexity of the connection of the centers 
of gravity. In this case, the robot has to travel on the 
skeleton line in order to generate the space map. 



Figure 7; Inregular shapes and their skeletons 


5.2. SPACE WITH STATIONARY OBSTACLES 

The solution to this particular problem is similar to the 
generation of the skeleton line. Then the skeleton line is 
converted into an equivalent graph, see figure 8. The 
generation of the complete graph is based on the synthesis 
of the current graphs generated by the movements of the 
robot. Firtsly, the robot uses its starting point as the "root" 
of the current graph G1 (see al in figure 8). Thus each 
segment of the current skeleton represents a branch of the 
graph. If the robot will be moved to a3 point, then a new 
graph G2 is generated and the new graph is synthesized 
with the pervious one for the production of a graph G3 
which represents two consecutive free navigation spaces. 



Fig. 1 




G2 G3 = G1 e G2 

^a4 

Figure 8: Graphical representation of a space 
with stationary objects, and the 
graph representation of the free 
space. 

The important feature of the G3 graph is that it has the 
ability to detect some graph nodes, which were perceived 
from the previous position. Such a case is the point a4. 
Thus this important observation plays a very significant role 
for the shorter distance to be traveled. More specifically, 
the robot has information related with the point al, a3 and 
a4, thus there is no need for it to travel on the segment 
(ala4), since it "knows" some information about the shape 
and the path related to ala4 brach of the graph. Another 
interesting point is the generation of intersected nodes. This 
means that, as the robot is moving and the new graph is 
generated, there are some locations from which the graph 
generates new braches, which intersect other braches 
constrcuted previously. The reason is that the robot can see 
areas viewed before from different angle. Thus it combines 
that knowledge for the generation of the new graph. 

5.3. SPACE WITH MOVING OBSTACLES 

In this case, which is also the most complex, the 
methodology followed is similar with the skeletonization of 
the navigation space with the only difference that the 
moving objects inside the space request one extra 
processing step. This step is the real-time detection of the 
moving objects in the each current free navigation space [6] 
and the appropriate reconstmction of the new graph. 

The complexity of this case increases significantly when the 
number of moving objects in the navigation area increases 
too. 


6. CONCLUSIONS 

In this paper, a methodology for the generation of the 2-D 
space map by traveling a short distance was presented. 
More specifically, the study of the problem was partitioned 
into three subcases, 1) space without obstacles; 2) space 
with stationary obstacles; and 3) space with moving 
obstacles. The advantage of this methodology is the ability 
to minimize the redundancy during the traveling and 
maximize the confidence function for the generation of the 
2-D map. The main disadvantage of the methodology 
proposed in this paper is the risk of generating a 2-D space 
map with some lost of information. 
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Abstract 

This paper begins with the background on how 
robotic implementation has flourished in Japan 
while only moderately growing in United States 
manufacturing. Contributing reasons are 
provided with focus on the constraint of the time 
and difficulty of robot applications software. 
Particular focus is made on the largest robot 
application segments in the world markets, 
material handling and assembly. Manufacturing 
demands of the 1990's are cited and scenarios 
of 21st century plants are described. Successful 
implementation of vision guided robots for these 
plants is described by use of the following 
system building blocks: 

1. An industry standard form factor, open 
architecture hardware control platform. 

2. A proven and integrated real time operating 
system and factory automation language within 
the platform that provides direa support for 
sophisticated motion control and rei time 
sensing. 

3. A building block, icon/menu based 
application software approach for both 
developing applications and for easy factory 
floor interface. 

4. An open architecture platform that allows 
other proven operating systems and/or cell 
control hardware/software solutions to coexist 
on the same back plane with the core 
motion/vision operating system. 

Background 

Despite robot technology being invented in the 
United States, the implementation rate in the US 
market feU behind that of the Japanese by tens 
of thousands of robots per year by the end of 
the 80's. The trend continued in 1993 despite a 
new parity in the cost of capital between US and 
Japan. New "Agile Manufacturing" thrusts 
have emerged in US manufacturing yet they are 

Copyright © 1993 American Institute of Aeronautics and 
Astronautics, Inc. All rights reserved. 


noticeably short of true flexible automation 
strategies. In the early 80’s Prudential Bache 
projected a US robot business of $2 biUion by 
19^. The market actually reached $500 million 
in the early 90's. The current installed base of 
industrial robots in Japan is 400,000 units, 
while in the US it is only 50,000 units. Japan 
installs more robots per year than the total 
installed base in the US.^ 

Why the slow adaptation of industrial robots 
and flexible automation in the US compared to 
the continuing robust growth in Japan? First of 
all. Prudential Bache was correct in identifying 
$2 billion of robot applications that should have 
been implemented in the US. In fact, it would 
have been even more had the technology been 
implemented at the Japanese rate. The 
fundamental constraint to growth of the robot 
industry was the difficulty in implementing the 
technology. Suppliers overestimated the time, 
capacity and technical skill level manufacturers 
had to implement automation. Financial 
justification followed the same parameters as 
applied to traditional hard automation, leading to 
only the most challenging applications gairung 
financial approval. ITiis only compounded the 
problem of successfully implementing the 
technology in manufacturing. 

A study by Adept Technology, Inc.2 showed 
that in the 1980’s the cost to design, write, 
debug and document application software was 
as high as 50% of the total robot system costs. 
And for the most part the software developed 
was custom to each application. Application 
software development, debug, and 
implementation was also determined to be a 
common bottleneck to system implementation 
schedules. Other robot companies and robot 
systems integrators have come to the same 
conclusions and have taken steps to standardize 
application software. In 1992 and first half 
1993 statistics released by the Robotics 
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Industries Association show healthy increases in 
the implementation of robots in the US market, 
due in part to more enabling software. 1992 
orders for US based robot manufacturers grew 
21.5%, and first half 1993 orders jumped 
40%. 1 

This paper will discuss how the utilization of 
existing and proven hardware and software 
modules which miniriiize custom software 
development cannot only increase US robot 
implementation beyond the current improved 
rates, but put the US in the ^obal lead. There 
will be no formulas or equations; manufacturing 
engineers who are our vital hope for 
competitiveness have no time for such. They 
have plenty of their own process algoridims to 
address. Iliey care about practical suggestions 
that allow them to address the cost and quality 
issues without compromising lead time to 
market That is what this paper will attempt to 
address. 


Manufacturing Forces of the 9Q's 

There are many forces influencing 
manufacturers today but three main forces stand 
out that will determine global manufacturing 
strategies: 

1. World Class Quality - World class 
quality is not an option but the price of 
admission for those manufacturers who want to 
survive the decade. The "Six Sigma" 
commitment by Motorola is one of the mote 
publicized connnitments to this imperative. 

2. Minimum Life Cycle Costs - The 90's 
is being dubbed as the "value decade", as 
manufacturers constantly evaluate their 
manufacturing costs against global competitors. 
As governments realign the world markets via 
NAfTA, GATT and EC agreements, even those 
manufacturers who limit themselves to domestic 
markets will see global competition in their 
home markets. 

3. Minimum Product to Market Times - 

The laptop computer used to write this paper is 
currently nine months old and has already been 
obsoleted by a higher performing model. 


Conventional high volume, low mix production 
strategies are in conflict with this force. 
Manufacturers are being challenged to have 
shorter changeover times or miming multiple 
products simultaneously on the same line. One 
power supply manufacturer has an objective that 
a customer can specify a custom power supply 
with a lot size of one and receive it within one 
day, built with flexible automation. 

Unfortunately, the above three forces have 
traditionally b^n in conflict with each other. 
Automation might have addressed quality forces 
but did not support rapid lead time to markets. 
Traditional robot automation may not have met 
cost objectives. And many US companies 
found Aat offshore labor did not meet quality 
requirements. 


Short Product Cydw 



The RIA and its member companies believe that 
there does not have to be a tradeoff among these 
forces. The belief is that properly applied, 
"Agile Automation" can "shrink' the triangle of 
tradeoffs. 



Adept Technology, Inc has developed a long 
range strategy for minimizing flexible 
automation ceU implementation and changeover 
time with the development of vision guided 
flexible feeding technologies and modular 
functional and application software. These 
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Rapid Deployment Automation 
Objective: Conservation of Time 


Years Months Weeks Days Hours Min. Sec. 


System Implementation 
Specifications 
Programming 
Editing 
Debug 

Product Changeover 


System MTTR 
Diagnostics 



efforts are part of Adept's overall vision of 
"Rapid Deployment Automation". 

Rapid Deployment Automation is an overall 
strategy focused at reducing the time, and 
thereby the cost, required to implement flexible 
automation. Given that systems typically cost 
2x the direct hardware costs. Adept feels 
strongly that reducing engineering content will 
be a primary driver to reducing the cost of 
flexible automation. Rapid Deployment 
Automation also address directly the product 
changeover / time to market issues discussed 
above. 

Flexible feeding is a key element in Rapid 
Deployment Automation, as by nature it reduces 
the amount of hard tooling in a robot cell, which 
reduces costs and promotes rapid development 
of cells.^ Flexible feeding replaces traditional 
part specific feeding systems such as bowl 
feeders and precision dunnage with intelligent 
sensor b^d robotic application software. 
Seiising is provided by integrated machine 
vision and real time force sensing, which are 
used to locate random or loosely oriented parts 
on smple conveying or infeed systems. 

Flexible Feeding software includes modules and 
algorithm’s to locate the parts, dete rmine their 
orientation, control the conveyors and 
recirculating devices, and acquire and place the 


parts. The Rexible Feeding concept, and the 
specific software modules 4at drive it, ate 
examples of how software can dramatically 
simplify the overall engineering task in a flexible 
automation cell. 

21st Centu ry Factories 

If Adept and other RIA companies are 
successful, the following scenario is possible 
for the year 2000 manufacturing assembly 
plants: 

1. Small, nimble plants, less than 250 
employees, located close to consumer bases of 
population or major OEM customers. 

2. Lot size of one, same day delivery to 
customers. Rapid changeover from one part to 
another and rapid implementation of incremental 
process and component part improvements and 
changes. 

3. Vision guided robot and flexible feeder based 
production. Limited investment in hard tooling. 
Intelligent software based feeding and fixturing. 

4. No software language based progr ammin g 
^thin factories. Simplified set up of highly 
intelligent software modules with imbedded 
process knowledge by floor personnel. 
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One way we will not meet the challenge is to 
continue to develop application software and/or 
motion control platforms "from scratch" because 
of what appears to be unique or process specific 
requirements. The development cycles for 
applications will exceed the product cycles 
themselves. The approach certain robot and 
systems suppliers are taking is to provide 
function and application software modules that 
serve as building blocks to the final line 
application program. 

This modular approach to software development 
is not new. There ate countless libraries today 
of software functions from various hardware 
and software vendors. The burden in using 
modular libraries is in the linkage between 
modules. This fine tuning still requires highly 
skilled programmers, and significant time. The 
new approach is to provide tiiese modules 
within a structure or framework that defines 
how the modules work together. Unlike object 
oriented programming, which offers a similar 
strategy, this new generation of factory 
automation software includes high level 
interfaces and process knowledge about the 
tasks themselves to allow setup by non 
programmers.'^ 

The best way to describe such an approach is 
Avith an application. Assume we have a dental 
adhesive applicator product with a cylindrical 
body with one open end, an impeller that must 
insert into the body, and a cap that must be 
inserted at the top of the body and impeller. The 
body arrives open end up in multiples on a tray 
moving on a roller conveyor and stops against 
an activated fixture. The durmage that carries 
the body is general purpose, and does not 
provide sigruficant precision in locating the 
bodies. The manufacturer is dealing with a 
product with a short life cycle and wants to 
quickly reuse the robot, grippers, feeders, and 
software when any of the tluee parts change in 
design. A single vision guided flexible feeder 
for Ae impellers and caps, and vision guided 
registration of the bodies in the trays along with 
inspection for acceptable inner body diameters 
are among the cell design parameters. 


The long path to a solution would be to interface 
one supplier's vision system and language with 
a second motion system and language, with 
custom software written in C, pe±aps on a PC 
platform. Not only is the development cycle 
long, but history has shown debug activity for 
this reproach to extend well past floor 
implementation. 

This potentially ill fated path relates to a lack of 
empathy of many developers with regard to 
"red time" in a robot cell on the factory floor vs. 
that in the work station environment. 
Deterministic, multitasking operating systems 
that can do context switching in micro seconds 
arc crucial when expensive grippers with 
expensive parts are on a collision path. More 
important Aan part and gripper damage, system 
downtime is unaffordable. Manufacturers like 
Toyota and Michelin are requiring 40,000 hour 
MTBF performance fiom equipment suppliers. 
In other words, bugs or errors at the work 
station are manageable, but unacceptable on the 
assembly floor. 

The integrated control platform ofters a much 
lower cost, schedule time, and risk approach to 
this solution. A platform that has thousands 
successfully runiung applications while keeping 
motion, vision and force sensing, and 
communications software under one language is 
the optimum place from which to start for tlus 
application.^ 
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An Adept control platform that control's Adept 
robots or any other robot mechanism would be 
made up of "wrung out’‘functional software 
modules that build into an overall applications 
module. These functional modules are menu 
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driven and u tiliz e a common data base structure 
and common global variables. Modules are tied 
together to create the specific application 
module, imbedding any necessary process 
knowledge, by use of die high level V+ 
programming language. Adept's standard 
functional modules and tools for vision and 
force guided assembly are represented by icons. 

Adept's patented AIM *“ (Assembly 
Information Manager) environment allows the 
developer to create additional linked icons as 
required to represent combinations of Adept 
modules and any application customization or 
process imbedding. 



I?ifl*/J<s|^t ^«s(^'' -fifeecf' »«S^ar^4f/f tiiCccfel 
















So the impeller assembly application can be 
quicldy developed with bug free software 
modules which also leave an rational user 
interface for modifying on the factory floor. 
Now let's assume the manufacturer has strict 
FDA requirements for Statistical Process 
Control records and has invested heavily in a 
PC based SPC program that he wants to apply 
to any workstation. The Adept MV controller 
contains a standard VME back plane with open 
slots for other boards. The manufacturer can 
have a VME board with an imbedded PC 
inserted on the Adept back plane and 
communicate as required with the Adept 
operating system and AIM. Customization is 


limited to defining what variables are needed by 
the SPC module. 


Summary 

The approach described here is simple: Use 
commercially available software and hardware 
wherever possible to minimize system costs, 
schedules, and risks. There is an abundance of 
software and hardware technology that does not 
have to be reinvented. This approach is what 
put the Japanese in a leadership role in the 80's. 
Ihe US now can do the same with a much more 
flexible array of vision guided robot 
technologies. 
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Abstract 


Commercial controllers for robots are typically 
custom-designed with closed architectures on 
proprietary hardware and software platforms. 
However, the cost of open controllers that use 
standard computer hardware and software 
platforms is rapidly decreasing. It is now 
practical to build an open controller for 
sophisticated robot and general motion control 
using off-the-shelf components. Such open 
controller designs allow the user to standardize 
on hardware platforms such as VME, and on 
operating systems and user interfaces, such as 
UNIX or Windows. This paper describes 
Nomad, an open architecture motion controller. 
Nomad consists of a set of software modules 
designed to control robots, various specialty 
machines, and machine tools. The base 
operating system for Nomad is LynxOS, a 
POSIX-compliant real-time UNIX system. 
LynxOS, and hence Nomad, runs on a munber 
of hardware platforms, including PC-ATs, 
VME-based PCs, Motorola and RISC 
processors. Nomad provides for sensor- 
controlled robotic motions, with user replaceable 
kinematics. It is programmable in C, with lull 
UNIX compatibility, including X Windows and 
MOTIF. Specialized programming interfaces 
and languages have been added. Open 
architecture controllers, as represented by 
Nomad, will have a major impact on the robot 
control industry. 


I Introduction 

! 

Typically, motion controllers for robots and 
' other machines have been developed based on 

i closed architectures and proprietary platforms. 

, The hardware, the operating system, and the 

i 


software were custom designed. The result was a 
closed system that was inflexible and expensive 
to develop and maintain. In addition, closed 
architecture controllers could not take advantage 
of the rapid improvements in cost and 
performance driven by high volume markets 
outside the motion control industry. Also, 
standardization on common platforms by 
customers and end users was impossible. 

In the past, the advantages of such custom 
architectures have been cost and performance. 
However, the cost of off-the-shelf standard 
computer hardware has dropped dramatically 
and performance has increased substantially in 
the recent past. The result of these rapid 
advances is that it has now become feasible to 
build sophisticated robot and general motion 
controllers using off-the-shelf components. 

These compare favorably in price and 
performance to custom designs. 

If standard real-time operating systems are also 
used in their designs, then open architecture 
controllers will automatically leverage future 
advances in hardware, driven by R&D funding 
in high volume consumer markets. That is, such 
open designs will become more and more 
attractive in price and performance over time. 

In addition to attractive cost and performance, 
open controller designs allow the customer to 
standardize throughout his factory on platforms 
such as VME, and on operating systems and 
user interfaces, such as UNIX or Windows. This 
is an attractive advantage not possible with the 
various custom designs. Also, such controllers 
can be tailored exactly to the requirements of the 
customer's application and the machine being 
controlled. This is expensive or impossible with 
closed custom architectures. 


I 
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With Nomad, Trellis Software & Controls, Inc. 
is introducing open architecture software for 
motion control. Nomad software modules can be 
combined with standard off-the-shelf computer 
hardware and software to build robot or other 
motion controllers that are standardized and yet 
tailorable exactly to specific applications and 
machines. 


Nomad Overview 

Nomad was designed to be the foundation for 
open architecture motion controllers for robots, 
various specialty machines, and machine tools. 

The base operating system for Nomad is 
LynxOS, a POSIX-compliant real-time UNIX 
system. LynxOS, and hence Nomad, runs on a 
number of hardware platforms, including PC- 
ATs, VME-based PCs, Motorola and RISC 
processors. Nomad-based controllers will 
automatically benefit from future advances in 
functionality and pricing of these hardware 
platforms. 

A controller built with Nomad software modules 
and off-the-shelf standard computer components 
provides full power of sensor-controlled robotic 
motions, with user replaceable kinematics. It is 
programmable in C, with fiill UNIX 
compatibility, including XWindows and 
MOTIF. Specialized programming interfaces or 
languages can also be added. 

Nomad-based motion controllers can be 
uniquely tailored to specific machines and 
applications. Through the power of C, X 
Windows, and MOTIF, tailored application 
packages can be developed, resulting in 
controllers that are very simple to use from the 
end user's perspective. These application 
packages can be menu-based or teach pendant 
programmable or whatever is appropriate in 
each instance. With application-specific 
interfaces, the end user need not be faced with 
having to learn the intricacies of an operating 
system or a programming language. 


Nomad Components 

Nomad components include TMOS, a Cartesian 
trajectory generator that provides the full power 


of sensor-controlled motions and a variety of 
industrial servo and I/O interfaces. The 
trajectory generator allows for six degree-of- 
freedom Cartesian position offsets in real time 
and coordination with multiple auxiliary axes. 
Kinematic solutions for different machine 
configurations can be incorporated into TMOS. 
The I/O control provides precise 
synchronization of both discrete digital and 
analog I/O with motion. 

C-WORKS, the second component of Nomad, 
provides the user environment for Nomad. It 
consists of a C library for communicating with 
TMOS, a system configuration tool for Nomad, 
control panel functions, and a set of 
demonstration programs. 

A wide variety of optional Utilities represent the 
third Nomad component. These utilities include 
a graphical servo tuning tool, a graphical 
machine simulator, a conunand line interface to 
Nomad, teach pendant support, and others. 

Nomad was designed to be open and modular. 
This allows Trellis to provide additional 
combinations of user environments with other 
motion systems in the future. For example, other 
user environments planned consist of a robot 
language and an NC environment for Nomad. 

The remainder of this paper will cover TMOS, 
the Nomad trajectory generator. 

TMOS Interface to Nomad 

This section describes the High Level C Library 
(HLCL) interface to TMOS. The HLCL provides 
many conversion functions for various forms of 
user data, performs integrity checks on user 
supplied data, provides parameter schedules to 
simplify motion programming, and hides 
internal TMOS data structures from the C 
program, minimizing the need for program 
modifications with TMOS product 
enhancements. 

In addition to advanced motion commands, the 
HLCL interface to TMOS contains a number of 
imique features that allow programmers to 
communicate position information in many 
different formats, use "schedules” to reduce 
complexity of simple applications, generate 



Position Allocation and Data Types 


complex trajectories with minimum effort, 
program sensor-guided motions, generate off- 
line paths, and provide user control of error 
handling. 


Connecting to TMOS 

Figure 1 illustrates the interaction between a C 
program using the HLCL and TMOS. 


Languages for motion control typically provide 
one or more specific data types for position data 
which can be declared and allocated within a 
program. These types must then be supported in 
communications with the outside world through 
files or networks. The inevitable problem is one 
of compatibility with off-line generated data or 
with new and improved releases of software. 


Some HLCL function calls work synchronously 
with TMOS by sending it a message and waiting 
for a response before returning to the user 
program. Some functions, such as a request to 
set a digital output, require no response and 
return inunediately after sending the message. 
Finally, some functions, such as status requests, 
interact with an information base that TMOS 
updates. 

Motions generally execute in parallel with the 
user program. That is, HLCL calls that initiate 
motion will return when the motion is queued. 
When motions complete, a message is sent back 
to the user process. A user definable callback 
function is then used to process the completion. 

Multiple user programs may login to TMOS. 
Since the information described above is kept 
within the space of each user process, each 
process has its own context and will not 
interfere with other programs that might use or 
modify the same parameters. 


The HLCL for TMOS allocates position data 
internally in an undocumented format called a 
TMOSPos. HLCL calls can be used to convert 
between a variety of user formats and a 
TMOSPos. TMOS will then return a handle to 
the TMOSPos for later use in motion and other 
calls. The burden of storage and communication 
of the user's data is left with the user. 

For example, an Euler format for position data 
can be declared and allocated in the user's C 
program. He can store and retrieve such data 
using files or the network. He would then call a 
TMOS routine to convert that data to a 
TMOSPos and return a handle to the user. 

Later, the user's C program can issue a motion 
call to TMOS using the handle. 

In this way, issues of upward compatibility with 
future versions of TMOS are avoided, and the 
user is free to archive his data in whatever 
format and precision he chooses. (That is, the 
format of a TMOSPos is not intended to be 
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secret, but merely to shelter programs from 
changes in its structure with future 
enhancements.) 

In addition to several other pieces of data, a 
TMOSPos keeps configuration information, so 
that when such a position becomes the argument 
of a machine motion, redundant solutions for the 
position can be reconciled. HLCL calls that 
return position information will also return 
configuration information. The user may keep 
this configuration information with the data or 
choose to ignore it. 


Schedules and Other Modal Parameters 

Sophisticated motion control algorithms employ 
many user or system definable parameters (as 
many as 100). It is inconvenient for the user to 
have to specify every parameter with each 
motion request. This problem is typically solved 
with modal parameters or system-wide 
parameters. 

In a multiprogramming environment for 
multiple machines, we must further consider 
whether a parameter is specific to a thread of 
execution or whether it is specific to a particular 
machine. For example, it is appropriate for each 
separate program to keeps its own default value 
of speed to be used with every motion request. 
However, since the tool definition refers to a 
current physical attribute of a machine, that 
characteristic must be shared globally with all 
programs. 

A TMOS schedule includes the modal 
parameters of speed, acceleration, motion type 
(linear, joint, etc.), and motion termination type. 

Other modal parameters used in TMOS include 
a base frame of reference definition (called 
Frame) and a tool frame of reference definition 
(called Too[). These modal parameters are 
defined inside the TMOS process itself rather 
than in a schedule, because they are specific to a 
particular machine rather than a thread of 
execution. Tool and Frame are set with specific 
HLCL function calls. 


Coordinate Systems and Coordinate Frames 

All Cartesian motions produced by TMOS are 
defined for a specific Tool Center Point (defined 
by a homogeneous Tool transformation) and are 
defined with respect to a specific user definable 
frame of reference (defined by a homogeneous 
Frame transformation). Tool and Frame 
transformations are kept modally within TMOS 
for each machine under its control. That is. Tool 
and Frame need to be set only once, and those 
values will be used for each Cartesian motion of 
the machine until the modal values are changed. 
Changes in Tool and Frame take effect only at 
the beginning of the next motion using Tool 
and/or Frame. 

Dynamic offsets to both transformations can 
occur at any time during a motion that uses 
DeltaTool and/or DeltaFrame. Changes to 
either of these transformations will take place 
immediately after the HLCL function calls that 
change them. 


Interpolation and Termination of Motions 

TMOS trajectories are broken into two 
categories, those that are terminated at a user 
specified destination (called destination 
terminated) and those that are unterminated 
(called vector). Vector motions are terminated 
by a succeeding motion or by an HLCL function 
call. Vector motions are useful for user defined 
sensor termination (also necessary for manual 
motion implementation.) 

For Cartesian motions, the trajectories are of the 
Tool Center Point (TCP). In some systems, the 
destination position is checked for reachability 
of the tool center point before the motion is 
attempted. However, for many machines with 
nonlinear kinematics, this does not guarantee 
reachability of all positions on the trajectory. 
Therefore, TMOS does not check for 
reachability of destinations. TMOS confines 
itself to dynamic testing of reachability of each 
position on the desired trajectory for both vector 
and destination terminated motions. The test is 
made one deceleration period ahead of the 
machine's current position on the trajectory, so 
that the machine can be stopped on the 
trajectory just prior to the offending position. 


The vector motions continue forever until 
stopped by a joint limit or are canceled or 
aborted. Many types of vector motion are 
available in TMOS, including move-to-joint- 
vector, move-to-world-vector, move-to-frame- 
vector, move-tO“tool-vector, and move-to-wrist- 
joint-vector. 

Destination-terminated motions are completed 
when the destination position of the motion is 
reached within the tolerance indicated by the 
termination condition. Motion types include 
joint-interpolated moves, straight line 
interpolated moves of the tool center point 
(TCP), circularly interpolated move of the TCP, 
straight line interpolation of the major axes in 
combination with joint interpolation of the wrist 
axes, and other more complex motion types. 

All the above motions can be dynamically offset 
by various sensory inputs, such as visioa 


Termination Conditions 

The terminating condition of a motion for any of 
the above interpolation types determines how 
closely the machine must come to its destination 
before returning to the calling subroutine. 

TMOS provides a number of termination 
conditions, including: 

• Return motion complete when the machine 
is within tolerance specified as Fine or as 
Coarse in the TMOS configuration. 

• Return motion complete and start next 
motion when interpolation of this motion is 
complete (do not wait for servo tolerance). 

• Start next motion when this motion begins 
deceleration. This provides the ability to 
blend motions. 

• Initiate a new motion immediately when the 
next motion instruction is received. This is 
useful for vector motions in manual motion 
pendant implementation and user defined 
sensor applications. 

• Other more complex termination types 
unique to TMOS, such as fillet termination 
which permits continuous motion at 


constant speed across motion segment 
boundaries. 


Real-time Trajectory Modification 

TMOS permits real-time modification of 
motions in progress. Routines are provided 
which accept a TMOS position as an 
incremental offset to either the part {Frame) or 
the tool position. This feature can be used to 
implement sensor-guided tracking for example. 
The C program can read the sensor in a loop 
and dynamically modify the motion in progress. 


Arm Configurations 

Since robots can generally reach a given 
Cartesian position in more than one way, 
Cartesian information alone is insufficient to 
completely determine the joint angles needed for 
a machine to reach that position. For example, 
the PUMA robot can generally reach a position 
with its wrist either above or below the elbow. 
Some redundant manipulators can reach nearly 
every position in an infinite number of ways. 

The additional information needed to dictate 
how a machine will reach a given position is 
referred to as configuration information. Some 
controllers use specific commands to specify the 
configuration to be used in reaching a position. 
TMOS assumes configuration is part of the data. 
Therefore, all library routines that expect 
Cartesian input data also provide a parameter 
for configuration information. 

For some simple machines, Cartesian positions 
can only be achieved in one way. Also, it is 
desirable to represent the positions of some 
objects without regard to how a machine might 
reach that object. Therefore, the Euler and 
Transform types defined for user representation 
of positions by the HLCL do not incorporate 
configuration information inside the data. 
Configuration data may be optionally added to 
these data types when they are converted to 
TMOS positions by the TMOS conversion 
functions, or when they are converted to 
machine joint angles. 


Motion Commands and Continuous Motion 


TMOS by default provides coordinated motion 
of all axes of a machine on every motion. This 
means that for each motion request TMOS will 
coordinate the motion such that each joint of a 
machine will begin and end its motion at the 
same instant in time. The meaning of speed 
therefore, must apply not to individual axes, but 
to some entity which represents a combination 
of the axes of a machine. In some cases, speed 
applies to the tool center point with respect to 
the Frame. In some cases it is not possible to 
define a single point at which speed can be 
measured (joint interpolated motion for 
example) and speed is defined relative to some 
maximum. In addition, for motions which 
involve changes in both orientation and 
translation of the tool, both rotation and 
translation speed constraints must be taken into 
account. 

TMOS considers up to three kinds of speed in 
the coordination of a motion, depending on the 
motion type: 

• Tool translation speed - the speed of 
translation of the defined tool center point 
relative to the specified Frame. 

• Tool rotation speed - the speed of rotation of 
one or more angles of orientation, measured 
in rotation units per second, 

• Axis speed - the speed of motion of an axis 
(either rotation or translation), measured 
relative to the maximum for that axis. 


The HLCL is designed so that routines return as 
soon as possible. In the case of motion requests, 
the routines will return as soon as an "ID" can 
be assigned to the motion. The calling program 
can then continue processing other events and 
handle motion completion as an asynchronous 
event. If the programmer wishes to issue a series 
of motion requests, which is often the case, he 
must wait for previous motions to complete. A 
library routine is provided, which permits the 
caller to wait for the completion of a specific 
motion, identified by the ID. 

Of course, the programmer can also cause 
continuous motion, by using a suitable 
termination type, so that the machine will not 
decelerate at the end of each motion and not 
wait for the completion of the previous motion. 
Continuous motion with the appropriate 
termination type will cause succeeding motions 
to be smoothly blended with previous ones. 
Smooth blending of motions at constant speed 
of the TCP is a unique option in TMOS. 


Summary 

TMOS, when combined with C-WORKS and 
other utilities, forms Nomad, a foundation for 
open architecture motion control for robots and 
other machines. Nomad software runs in a 
UNIX environment on off-the-shelf hardware, to 
provide low cost and high functionality motion 
control, standardized and yet amenable to 
tailoring for highly specialized applications. 


Sneed Limits 

TMOS imposes speed limits only on a joint 
basis. That is, TMOS continuously monitors the 
speed of all axes. If any axis exceeds its speed 
limit as defined in the TMOS configuration file, 
then all axes are scaled back to maintain 
coordinated motion at the limiting joint speed. 
No limit is imposed on Cartesian translation or 
Cartesian orientation speed control. 

TMOS also permits acceleration control on a 
per-motion basis. 
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Abstract 

The time is right to transition telerobotics beyond 
the traditional hazardous environment domain into 
industrial repair and remanufacturing applications. 
Air Force depots are prime examples of an in- 
dustrial environment where small batch sizes, fea- 
ture uncertainty, and varying workload, conspired to 
make classical industrial robotic solutions impracti- 
cal and telerobotics a key enabling technology. The 
AFMC Robotics and Automation Center of Excel- 
lence (RACE) has launched the Unified Telerobotics 
Architecture Project (UTAP) to champion the de- 
velopment of the support infrastructure necessary to 
foster creative development and innovative utiliza- 
tion of emerging telerobotic technologies for depot 
applications. The objective of this paper is to demon- 
strate that telerobotics is a viable solution to a wide 
range of dual use applications, highlight the benefits 
from a unified approach, and provide an overview of 
the UTAP. 

1 Introduction 

The United States Air Force has five major Air Lo- 
gistic Centers (ALC), or depots, thdi perform peri- 
odic weapon system maintenance. A significant por- 
tion of the periodic maintenance workload involves 
repair and remanufacturing. The small batch sizes, 
feature uncertainty, and varying workload that char- 
acterize the depot remanufacturing environment con- 
spire to make classical industrial robotic solutions 

°This paper is declared a work of the U.S. Government and 
is not subject to copyright protection in the United States 


impractical for a wide range of depot processes. The 
robotics and artificial intelligence necessary to solve 
those problems with a completely automated system 
is beyond our grasp technically and economically. 
An equally demanding constraint is applied by a de- 
pot level workforce resistant to complete automation, 
and a management structure soured by the unfulfilled 
hyperbole of past robotics projects. But the require- 
ment for robotic/automation based solutions is grow- 
ing. New processes that are environmentally safe, 
but too demanding for human operators, the need 
for increased process consistency with lower manu- 
facturing tolerances, and competition with industry 
aU point to a larger role for judicious application of 
advanced robotics technology. The critical missing 
element is a method to bridge the gap, both cultur- 
ally and technically, between manual operation and 
full automation. Telerobotics provides the means for 
building that bridge. 

We broadly define telerobotics as the technologies 
and systems that permit a human operator to direct 
and/or supervise the operation of a remote robotic 
effector mechanism [1, 6]. Telerobotics does not im- 
ply a particular solution, but rather encompasses the 
whole range of application driven solutions ranging 
from telepresence to supervisory control. The key 
premise is to augment, not replace, the human oper- 
ator by blending the individual abilities of each sys- 
tem, Humans have superior cognitive and pattern 
recognition skills, while the robot is a tireless pre- 
cise positioning system. The telerobotic system is 
not a threat to job security, but rather a new inno- 
vative tool that adapts to the operator to maximize 
productivity. 
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Unfortunately, telerobotics is more of a concept 
than an off-the-shelf technology. The basic compo- 
nents are available, and prototypes exist in various 
forms in numerous laboratories. However, develop- 
mental efforts have been targeted toward undersea, 
space and nuclear material handling applications. 
Solutions tend toward point designs customized to 
the particular application. Development of low cost 
systems was not a priority. Viewing the existing 
telerobotics market as a small ni( li^\ the major robot 
vendors have been reluctant to oxfxMid the resources 
necessary to modify their control systems to sup- 
port a broad range of telerobotic solutions. Con- 
sequently, the manufacturing sector has been slow 
to embrace telerobotics and efforts to transition the 
technology from the laboratory to the shop floor are 
in their infancy. Therefore, we are presented with the 
unique and compelling opportunity to significantly 
influence the development of an emerging technology 
with the potential to radically enhance the produc- 
tivity of the depot, and industrial, remanufacturing 
processes. The challenge is to implement the lessons 
learned from the mistakes of the past, to change our 
robotics technology insertion philosophy. Instead of 
developing one-o/ systems, we must embrace the cre- 
ation of a unified systems concept that supports a 
large range of applications, provides an evolutionary 
path for incorporating new technologies, and reduces 
life cycle costs. 

The Air Force Materiel Command Robotics and 
Automation Center of Excellence is championing the 
development of a unified framework or infrastructure 
that supports judicious insertion of telerobotics tech- 
nology. The intent of this paper is threefold. First 
we present the case for telerobotics as a key enabling 
technology for depot process ranging from large air- 
craft paint stripping to surface finishing of compo- 
nent parts. Section three highlights the benefits of 
utilizing an unified architecture (infrastructure) to 
implement process solutions. In section four, our ef- 
forts to make that unified infrastructure a reality via 
the Unified Telerobotic Architecture Project (UTAP) 
are discussed. Conclusions are in section five. 

2 Why telerobotics? 

The best way to present the case for telerobotics 


for depot modernization is to overview the require- 
ments for several target applications. Previous pa- 
pers have presented detailed discussions of the teler- 
obotic solutions to aircraft skin repair and fuel tank 
sealing/ desealing [4, 5]. The remainder of this sec- 
tion is devoted to overviews of two processes targeted 
for prototype development under the UTAP. Specific 
process requirements (angle of incidence, standoff, 
accuracy) are in [6]. 

2.1 Aircraft Corrosion Control 

At predefined intervals, aircraft are flown to the de- 
pots where existing paint is removed to allow surface 
inspection and repair of any corrosion damage. Be- 
fore returning to active service, corrosion inhibitors 
are applied and the airframe is repainted. The pro- 
ductivity of all three processes; stripping, inspection, 
and painting can be improved by insertion of teler- 
obotic systems. Paint removal is the initial target 
application in this area. 

Process engineers responsible for corrosion con- 
trol are being drawn to robotic systems due to ef- 
forts to eradicate the chemical stripping processes. 
Alternative paint removal techniques, while not en- 
vironmentally hazardous, can be unsuitable for hu- 
man application. High pressure (18K psi) water jet, 
C02 ice pellets, flash lamps and lasers based appli- 
cation tools must be mounted as robot end-effectors. 
Even ignoring the obvious physical dangers, the ap- 
plication tools are too heavy (50 lbs or greater) for 
continuous human operation. Plastic Media Bead 
(PMB) and sodium bicarbonate blasting can be per- 
formed by operators in special air breathing suits, 
but the task is monotonous and messy. Another au- 
tomation driver is the desire for stringent processes 
control. Many of the alternative stripping methods 
remove paint by blasting the aircraft or part with 
some media. Blasting introduces stress into the sur- 
face leading to reduced fatigue life. Tight control of 
the blasting process is necessary to minimize those 
side effects. Robotic systems provide a level of pro- 
cess control superior to that of a human operator. 
The unfriendly application environment, heavy pay- 
load, repetitive non-contact task nature of the task, 
and requirement for tight process control make the 
paint stripping operation ideally suited for robotic 
intervention. 
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The USAF has sponsored the development of 
large robotic paint stripping systems at three ALCs. 
Southwest Research Institute (SwRI) developed a 
custom system that is being used to strip F-16 air- 
craft with PMB and is being retrofitted for C02 
blasting of F-15s [7]. The Large Aircraft Robotic 
Paint Stripper (LARPS) REPTECH project is a 
large SCARA arm riding up and down on a col- 
umn attached to an automated guide vehicle (AGV) 
[3]. The end-effector is a new commercial robot us- 
ing a high pressure water process. Both SwRI and 
LARPS are big (50K lbs), expensive (>$2M), fuUy 
automated systems. But those processes are part of 
a large overall process that does not lend itself well to 
automation, ie the masking and general preparation 
of the aircraft for painting/stripping. At least half of 
the total process remains very manpower intensive. 
Add in the fact that several installations already have 
stacker (telecrane) phdforms that allow human oper- 
ators to access large portions of the aircraft surface 
and one can make a compelling argument for aug- 
menting the existing workforce instead of replacing 
it. 

A telerobotic aircraft paint removal scenario would 
look like the following. Attach a small robotic end- 
effector to the underside of the telecrane. The op- 
erator manually drives the stacker crane into the 
proper stripping position and then uses a joy-lick 
and the robots force sensing capability to register 
the actual worksite to a predetermined stripping tra- 
jectory. After setting stripping and other application 
parameters the operator becomes a supervisor as the 
system autonomously executes the stripping process. 
To perform the process the system must maintain a 
stripping process dependent separation/standoff dis- 
tance and a tooling angle of incidence to the work- 
piece normal. While the primary mode of opera- 
tion is supervisory, the system shall support a shared 
control feature that slaves end-effector position to 
the joystick with the system automatically regulat- 
ing standoff and angle of incidence so that the oper- 
ator can quickly remove any excess paint left by the 
autonomous process. 

The robotic system is not responsible for aU paint 
removal. The human operators, necessary for the 
preparation, would stiU be utilized to strip hard to 
reach locations. But the new tools free the opera- 


tor from directly applying the stripping process to 
over 80% of the aircraft while dramatically improv- 
ing process control. A properly designed telerobot 
system will support aU stripping processes. Switch- 
ing to painting and inspection tasks only requires 
some quick change tooling. Attached to mobile lift 
platforms the same system could perform flight line 
touch-up, or cross over to dual use applications like 
highway bridge repair. 

2.2 Surface finishing 

The standard procedure for repairing dents in en- 
gine nacelles is to fill the indentation with a fiber- 
glass epoxy compound and then finish the surface 
to the required smoothness. Repair of aluminum- 
honeycomb aircraft skins frequently requires a simi- 
lar blending process around the seams of the patched 
section. Grinding is also employed to remove the 
paint in the vicinity of the repair site. The common 
theme in these, and many other backshop operators, 
is the utilization of manual sanders and grinders. The 
health risks imposed by repetitive motions and dust 
inhalation combined with requirements for stricter 
process control and repair of more exotic compos- 
ite parts are driving the search for incorporation of 
robotic technologies. 

The customer does not consider the old approach 
of tight fixturing and preprogrammed motions an op- 
tion. Management does not want to replace workers, 
but rather make them more productive and provide 
a safer environment. What is mandated is a better 
tool to replace the current hand sanders and polish- 
ers. Telerobotics provides that tool. 

To augment the surface finishing task, a teler- 
obotic system must support the following function- 
abty. Instead of holding the hand tool, the operator 
grasps an input device (possibly a force reflecting joy- 
stick) that commands a robot permanently attached 
to the shop floor. Work pieces are stiU clamped onto 
dollies and rolled into the robot’s work area, but no 
additional fixturing is required. Through a quick 
change mechanism the system is capable of matching 
the tooling to the task. The operator drives the robot 
into contact with the surface and performs a series 
of motions to complete the task under two shared 
control modes. In mode one the system maintains a 
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contact force and a tooling angle of incidence to the 
workpiece normal. In mode two the system maintains 
a tooling angle of incidence to the workpiece normal 
while allowing the operator to modulate the applied 
contact force. Commands that would result in a con- 
tact force exceeding a predefined limit are automat- 
ically regulated at the limit. Both modes must be 
supported without any a priori knowledge of part ge- 
ometry. However, 1 li<' system must be flexible enough 
to efficiently incorporate automatic trajectory gener- 
ation software when it becomes commercially avail- 
able. Dual use applications of this technology range 
from polishing of bathroom fixtures to removing ma- 
chining marks on airframe skins and ship impulsers. 

3 Why a unified approach? 

For a judicious insertion to take place one must 
specify the proper level of technology and deliver 
a system specification that is cost effective. The 
true potential of telerobotics can not be realized if 
every application requires a costly custom solution. 
A unified infrastructure for telerobotics is driven by 
the overriding objective of reducing system life cycle 
costs. Insertion cost decrease as support ability and 
reliability increase along with ease of upgrading. 

3.1 Insertion Costs 

Under the custom solution approach, software devel- 
opment and system integration are at least 60% of a 
new insertion project and almost always the bottle- 
neck. A common framework allows basic commands 
for movement, gripping, trajectory generation, ob- 
stacle avoidance, and operator interface, etc to be 
developed at a higher level of abstraction. After pay- 
ing for the initial software development, the scope of 
the software development task is reduced to develop- 
ing the specific code that is required to implement a 
new process. Phase two of the UTAP will validate 
our estimate that initial development costs can be 
amortized within the first three applications. JPL 
estimated that a unified architecture could be recon- 
figured for a new application in one man week. 

3.2 Upgradability 

The government procurement process requires that 
we rigorously specify the functional requirements of 


any system we contract for. The standards and spec- 
ifications we mandate must be achievable by mul- 
tiple vendors to allow fuU and open competition. 
Without standards we can not remain competitive 
as technology advances. Standardizing at the inter- 
face level, provides the hooks and scars for future 
upgrades without limiting the contractor’s freedom 
to provide the most innovative and cost effective so- 
lution. For example, replacing a trajectory genera- 
tor module must not require an extensive software 
rewrite because the existing generator is imbedded 
into some piece of spaghetti code. Switching joy- 
sticks should be no more complicated then switching 
printers on a computer system. By mandating stan- 
dardization at the interface level we take the first step 
toward full interoperability. A unified architecture 
supports a system design methodology that evolves 
as the culture and technology evolve by providing a 
framework that builds in the future instead of locking 
it out. 

3.3 Supportability 

A common infrastructure breaks the one robot, one 
technician, one programmer, single operator loop we 
are currently trapped in. A unified architecture per- 
mits a common operator interface, reducing training 
requirements. The higher level of abstraction elimi- 
nates the need for programmers to be fluent in mul- 
tiple robot languages, again reducing training time 
and expense. Adding a new system into an existing 
facility no longer mandates the creation of a whole 
separate support hierarchy. Upper level support is 
easily centralized. By avoiding custom mechanism 
designs, hardware maintenance support costs are also 
dramatically reduced and are now available from a 
variety of sources. A single internal organization will 
provide technical support for a whole depot. The 
need for an expensive support contract, usually with 
the original manufacturer of the custom system, is 
eliminated. 

As the size of our workforce continues to decrease, 
increasing the productivity and range of skills of indi- 
vidual operators becomes more important. A single 
operator must become proficient in numerous pro- 
cesses and the robotic systems that are embedded in 
them. A common infrastructure wiU support a corn- 
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mon operator interface allowing a seamless transition 
across aU the telerobotic systems in the depot. Upon 
login the system will autoconfigure the look and feel 
to match known operator preferences. 

3.4 Reliability 

Software is the most unreliable portion of robotic sys- 
tems. A common architecture allows a majority of 
the software to be ported from one application to the 
next. Minimizing the creation of new code maximizes 
system reliability. Selection of proven hardware com- 
ponents mitigates mechanical breakdown. 

4 UTAP Overview 

The Robotics and Automation Center of Excel- 
lence (RACE) has embarked on a multi-year initia- 
tive to demonstrate the feasibility of telerobotic tech- 
nologies to accomplish a wide range of manufactur- 
ing applications and to develop a unified architecture 
that radically reduces the life cycle costs of teler- 
obotic systems. The Unified Telerobotic Architec- 
ture Project (UTAP) is tightly coupled to related 
efforts in the national labs and the domestic manu- 
facturing industry to maximize leveraging and dual 
use technology transfer opportunities. 

In Phase 0, completed in FY93, NASA’s Jet 
Propulsion Laboratory (JPL) performed an engineer- 
ing study to define a telerobotic architecture capable 
of performing a wide range of ALC remanufactur- 
ing applications. The study began by distilling a 
representative set of processes into a global set of 
functional requirements sufficient to span the needs 
of depot activities. The state of commercial and 
near-commercial technology was then surveyed to de- 
termine how these requirements may be met in an 
integrated system. A comparison of the functional 
requirements and the available technology products 
then produced an architecture of system components 
and their connectivity [1, 6] 

RACE has tasked the National Institute of Sci- 
ence and Technology (NIST) to act as coordinator 
and prime contractor for the FY94 study of issues 
pertaining to the specification and validation of the 
architecture. 


Phase 1, currently underway, is a joint effort by 
NIST and JPL to examine the feasibility of imple- 
menting the initial JPL architecture and to develop 
preliminary interface specifications between aU func- 
tional blocks of the UTA. This effort will include 
consideration of telerobotic technologies being devel- 
oped at national labs and emerging standards such as 
the Next Generation Controller Specification for an 
Open System Architectural Standard. A workshop 
wiU be held with industry and national lab represen- 
tation to solicit input for the validation and consol- 
idation of these preliminary interfaces into the UTA 
design. The output of this workshop will be a work- 
ing document that describes the interfaces and func- 
tional blocks of the UTA for Phase 2. 

In Phase 2, a systems integrator under contract 
to NIST shall be tasked to analyze the UTA inter- 
face specification and determine if an UTA compliant 
system can be implemented to solve the representa- 
tive application set, or suggest modifications to the 
portions where compliance is not possible. The con- 
tractor wiU then validate their analysis by designing 
an UTA compliant system and performing the vali- 
dation test set, which consists of: 

• Autonomous regulation of separation/stand-off 
while the human operator controls the other two 
cartesian coordinates via joystick, 

• Autonomous force regulation along a gently 
curved surface while the other two tangential 
cartesian coordinates are are controlled via joy- 
stick, 

• Registration of a workpiece by use of a vision 
system and fiducials, 

• Autonomous regulation of tooling angle of inci- 
dence to the workpiece normal, and, 

• Vision based tracking of circular trajectory on a 
planar surface. 

The contractor wiU demonstrate accomplishment of 
the tasks on physical hardware using an Adept mo- 
tion servo system and then demonstrate the inter- 
operability and modularity of the architecture by re- 
placing the Adept system with a Trellis motion servo 
system. Specific designs for three prototype systems 
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and an estimate of system integration costs and po- 
tential cost savings from a unified approach are also 
required. 

Future phases of the UTAP are not as crisply de- 
fined, but the objective is to continue UTA refine- 
ment to the goal of a releasing the architecture spec- 
ification in full system request for proposals in FY97. 
Phase 3, the prototype development phase, will see 
contracts awarded to systems integrators to imple- 
ment the Architecture/Interface specifications as de- 
pot prototypes. Each selected process will demon- 
strate a different facet of telerobotic technologies 
and will provide a core capability of the system. 
The three projected applications are: Telerobotic 
Telecrane Paint Stripping (T2PS), Telerobotic Sur- 
face Finishing (TSF), and the Telerobotic Cutting 
System (TCS). A parallel effort to create more so- 
phisticated prototypes which exercise even 

more of the potential of telerobotics is also antici- 
pated. Currently our prototype center PUMA is be- 
ing retrofitted with more commercial version of the 
Onika software environment developed at Carnegie 
Mellon University [2]. Fitted with a force and vi- 
sion system, the enhanced PUMA wiU be used to 
investigate the advantages of fuU interoperability in 
a telerobotics environment. Phase 4 encompasses a 
6 month operator prototype evaluation and analy- 
sis task. Throughout the prototyping and operator 
evaluation phases lessons learned will be feed back to 
produce a more robust architecture specification. 

5 Conclusion 

The problem is enhancing the quality of Air Lo- 
gistic Center repair and remanufacturing processes. 
The constraints are technical, economical, and cul- 
tural. Creative development and innovative applica- 
tion of telerobotics technology is the solution. The 
challenge is to redirect our system design philoso- 
phy to a methodology that embraces integration of 
commercially available components under a unified 
telerobotic architecture or framework. In coopera- 
tion with other national laboratories and agencies the 
AFMC Robotics and Automation Center of Excel- 
lence is championing the development and prototyp- 
ing of a unified architecture that will pave the way 


for judicious insertion of telerobotic systems into a 
wide range of dual-use applications. 
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Abstract 

There is an increasing demand for space robotic sys- 
tems which can reduce the number of potentially haz- 
ardous EVA’s on manned space missions. In addition, 
telerobotic maneuvers can easily become long and tire- 
some for the operator. This paper describes a robotic 
system which accepts motion and control commands 
which can be generated autonomously. 

The system developed has been designed to per- 
form an autonomous grapple based on guidance con- 
trol feedback provided by images from a single camera 
mounted on the slave robot’s end effector. The vision 
system consists of three parts. The first part is signa- 
ture based, trained on an arbitrary grapple interface 
(i.e. no special targets are required for guidance); it 
provides estimates for the 3D attitude of the interface 
by interpolating sampled signature correlations. These 
signatures are essentially the distribution of line ori- 
entations obtained by radial integration of the Fourier 
transform of a pre-processed edge image. The second 
part estimates the range and bearing of the interface 
based on the first and second moments of the prepro- 
cessed edge image of the interface. And the third stage 
of the algorithm verifies the results. 

The robot path follows a linear translation trajectory 
which is repeatedly adjusted for errors via the vision 
system. The end effector’s attitude is adjusted along 
the trajectory such that the grapple interface always 
remains in center view of the camera. 

Introduction 

Teleoperations are becoming increasingly important 
in hazardous environments (e.g. chemical plants, nu- 
clear power plants, space). Space systems applications, 
such as space-based assembly and maintenance, auto- 
matic rendezvous and docking, space exploration, and 
satellite monitoring and tracking ^ are of particular in- 
terest due to potentially long delay times between op- 
erator and robot. For instance, it has been estimated 
that robotic operations can take several times as long 
as extra-vehicular activity (EVA) to perform similar 
tasks Long delay times and limited bandwidth re- 
quire the robot to accept only high level commands 
and to possess locally a certain degree of autonomy. 


Object recognition and attitude determination of 
objects are essential components for successful sen- 
sor based teleoperational semi-autonomous robotic sys- 
tems. This paper will cover camera based systems, due 
to the relatively low cost of CCD camerzts and their 
wide use in remote robotic systems. 

Current vision based robotic systems utilize visual 
guidance targets. These targets must be placed on ob- 
jects with which the robot is to interact"^. However, 
when the objects are not readily accessible to humans, 
which is the case when operating in a hostile environ- 
ment such as space, the system restricts the class of 
robotic interactions to those which are specifically iden- 
tified and designed a priori. 

The new vision system developed eliminates the need 
for these guidance targets by allowing the object, or 
part of the object (i.e. a grapple fixture), to become the 
robot’s visual guidance target. This is accomplished 
by teaching the vision system the object by presenting 
different views. This training could be done with a 
physical object or by using a CAD model of the object. 

The complete description of a particular target rel- 
ative to the camera consists of six parameters: roll, 
pitch, yaw, range, and two bearing parameters. All six 
can be estimated, in principle, from a single camera 
image and knowledge of the target’s solid geometry. 

We have developed a new technique for determin- 
ing the three-dimensional roll, pitch, and yaw attitude 
target parameters and the three translation parameters 
assuming that the object is known and unoccluded. 

Method 

We restrict the class of images to those of machined 
objects, which characteristically produce sharp edge 
discontinuities. The edge discontinuities result from 
the projection onto the image plane of the polytopes, 
cylinders and conic sections comprising the object. Our 
approach relies on these projected edges as the basic 
features required to analyze and interpret the image 
data. 

Attitude Estimation 

The technique for estimating the attitude relies on 
extracting a signature of the object as viewed by the 
camera, and then matching it against signatures of the 
same object with known attitudes, generated off line 
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from a model of that object. The attitude estimate is 
obtained by interpolating among the signatures with 
the highest matching scores. The overall procedure is 
diagramed in Figures 1. 



Figure 1: Overall data flow diagram for the estimation 
of the attitude parameters. 

The algorithm computes as a signature the distri- 
bution of edge segments in the image as a function of 
orientation in the image plane. When an object un- 
dergoes an attitude transformation, the distribution of 
line orientations in the image plane changes; therefore, 
the signature contains implicit information about the 
object’s attitude. On the other hand, the signature is 
insensitive to the range and bearing of the object, as 
these do not affect the distribution of line orientations 
in the image. The signature matching procedure ap- 
proximates the inverse map from line orientations to 
object attitude. 

The signature extraction computation involves three 
steps. First, a binary line image is obtained from the 
original picture (Fig. 2), reducing the effect of changes 
in illumination of the target (Fig. 3). The prepro- 
cessing requests identification of the object within the 
field of view, and removal of clutter in the image. We 
achieve this by an image segmentation strategy dis- 
cussed in detail in the Appendix. The line image is 
then mapped into the two-dimensional Fourier domain, 
effectively collapsing range and bearing information, 
while preserving information on the object’s roll, pitch, 
and yaw (Fig. 4). Lastly, a weighted sum of the magni- 
tude in the Fourier image yields the distribution of line 
segments as a function of orientation, which serves as 
an attitude signature (see Gonzales and Wintz^ for an 
introductory discussion on the properties of the Fourier 
Transform applied to image processing)(Fig. 5). 

In particular, the Fourier transform provides an effi- 
cient and robust means of extracting the signature. In 
essence, any straight line in the image plane is mapped 


by the Fourier transformation into a straight line pass- 
ing through the origin of the transform domain, and 
orthogonal to the original line. The distance from the 
origin of the original line results in a complex phase 
modulation of its transform. By linearity of the Fourier 
mapping, an image consisting of several straight lines 
is transformed into a superposition of lines emanat- 
ing from the origin. Thus, a radial integration of the 
Fourier transform’s magnitude, about the origin of the 
transform domain, yields the desired signature. To 
compensate for the finite thickness and length of ac- 
tual line segments in the image, the Fourier transform 
is radially weighted, to deemphasize edge thickness. 

The attitude parameters are found by performing a 
cyclic cross-correlation of the target signature with the 
library signatures and selecting the maximally corre- 
lated match. The best signature picked reflects the ob- 
ject pitch and yaw. The offset of that signature match 
reflects the roll. Since signatures are 180® symmetri- 
cal there is a 180® ambiguity in the roll measurement. 
This ambiguity will be resolved in the match verifica- 
tion process described in Section 2.3. 

Position Estimation 

The technique for estimating the range and the two 
bearing parameters of the object relies on the center of 
gravity yc^ and the sum of the variances cr^ along 
the x-axis and the y-axis cr^ of the object’s edge 
image: 

<rl = <rl + al ( 1 ) 

The range of the object is determined using It can 
be shown, that (t^ is invariant to rotation and transla- 
tion of the image Using a perspective projection, 
and assuming that the size of the object is small com- 
pared to the range, the distance of the object in the 
actual image, zq, is given by, 

^ _ fo ^ ^ref ^ “^ref /o\ 

Zq - 7 (^) 

/ref 

where <7© is the square root of the variance of the actual 
image, /o is the focal length of the lens used, is 
the square root of the variance in the edge image of the 
matching signature, is the focal length of the lens 
used in generating the signature library, and is the 
range of the object used during training. 

By knowing the deviation of the center of gravity of 
the actual edge image against the center of gravity of 
the edge image of the matched library signature, the 
two bearing components are determined by: 

p — tan“^ ^ — tan“^ (3) 

/o /ref 

<t> = tan"^ ^ — tan"^ (4) 

/o /ref 

where (xi, j/i) and (xre/,2/rc/) are the center of gravi- 
ties of the actual edge image and the training edge im- 
age respectively, p and <j> are the values of the bearing 
parameters along the y-axis and x-axis respectively. 
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Figure 2: Synthetic image of the Micro Interface de- 
vice, a typical machined object. Figure 4: The weighted 2D FFT transform of the edge 

image of the Micro Interface Device. 



Figure 3: The edge image for the Micro Interface de- 


gridCE Signature 



Figure 5: The extracted signature, encoding the dis- 
tribution of line edge orientations in the original image 
of the Micro Interface device. 



Model Based Attitude Estimation and Verification 
The six attitude parameters found in Sections 2.1 
and 2.2 must be verified and the ambiguity of the 
roll must be resolved. This is accomplished with the 
help of a perspective projection (overlay) of a three- 
dimensional model of the target (Figure 6) in a cross 
correlation with the edge image of the object seen by 
the camera. The overlay with the highest correlation 
yields the best estimate of the attitude and position of 
the target. 



Figure 6: Data flow diagram for estimating the pose 
of an object based on the projection of a three- 
dimensional model of the target 

The six pose parameters of the n-best matches from 
the signature based algorithm in Section 2.1 were used 
to generate n corresponding overlays. A typical overlay 
is shown in Fig. 7. Those overlays were matched 

The three-dimensional model of the object was de- 
fined in terms of polygons where each polygon was de- 
rived by its vertices. To each polygon a surface normal 
was assigned to calculate the visibility of the polygon 
for the current attitude of the object. The visibility 
check was achieved by determining the sign of the dot 
product between the normal vector and a vector ex- 
tending from the the polygon to the view point. For 
positive values the polygon was visible and for negative 
values invisible. 

To increase the robustness and precision of the cross 
correlation we correlate the directions of the edges 
with the direction of the overlay edges. By looking 
at the directional image gradient we obtain not only 
the strength of the edge but also its direction. The 



Figure 7: Example of an overlay which was used in a 
cross correlation to estimate the pose of a target (The 
jagged edges are caused by the typ-setting process). 

modified cross correlation can be stated as 

AT, ^ 

match{i^ i) — EE|( Phm P^ov + + !/))[ 

x=0 y=0 

( 5 ) 

where (•} denotes the dot product between two vectors. 
The vectors piim and are defined as 

the two-dimensional vector jT from the 

camera image and overlay image respectively. It has to 
be noted that in Equation 5 we only have to perform 
the cross correlation in the vicinity of the projection 
of the object model because the signature based al- 
gorithm gives reliable estimates of the position of the 
target. 

The combination of the signature based method 
shown in Section 2.1 and the above approach based 
on cross correlation allows us to overcome one of the 
main disadvantages of the model based methods shown 
in the literature where a correspondence had to be 
established between image features and model features 
to solve for the attitude parameters. With the signa- 
ture based algorithm we are able to prune down the 
search tree of possible aspects of the model and reduce 
the range of the cross correlation considerably. 

Robot Control 

The algorithm for moving the robot towards the tar- 
get to perform a grapple is described below: 

♦ Using a camera mounted on the end effector, esti- 
mate the position and attitude of the target (i.e 
the handle to be grappled) with respect to the 
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camera. We will call this frame H where the term 
frame refers to both attitude and position. 

♦ We can define a frame, Cfh^ with respect to the 
target, which is the desired final frame of the cam- 
era for the approach. Now we can use our estimate 
for the target to come up with an estimate for Cf, 
called Cf, with respect to the Camera frame C. 
If C is within tolerable limits of Cf then we are 
at the final position and attitude and can grapple 
the object. 

♦ If we have not reached Cf then we can calculate 
our next desired camera position by using the fol- 
lowing constraints on the next camera frame, de- 
noted by iV((7). 

1 . The origin of N{C)^ d enoted by N{C)o = 
N{Cq), falls on the line CoCfo- 

2. N{C) should be at most a distance of dmax 
from C. 

3. The . 2 :-axis of N{C), denoted by N{Cz) 
should point towards H^. 

4 . N{Cx) should be perpendicular to both 
N{Cz) (of course) and Hy. In particular the 
sign of the vector is defined by N(Cx) = 
Hy X N{Cz). 

• We can calculate N{G) with respect to G from 
N{C), since the relationship of the camera frame 
C to the end effector frame G is known. This 
information can be put in the form of relative 
(x, y, z, iZ, P, Y) moves. 

• Command the robot to make the relative move 
calculated above. 

# Repeat the entire process. 


Results 

We have tested the algorithm on a set of synthetic 
images of an interface device used in space system ap- 
plications (Fig. 2). The Micro Interface device is used 
in SSF robotic operations. A ray-tracer was used to 
generate the synthetic images’ aspect transformations 
of the target with respect to the image plane. Although 
the results presented in sections 3.T3.4 were generated 
with synthetic images, similar results have been ob- 
tained for real camera images. 

A 5 X 5 signature library was generated from syn- 
thetic images to cover a square patch 10 ® on the side 
in the pitch-yaw plane with an inter-signature sepa- 
ration of 2.5® in each direction. The center orienta- 
tion was selected to correspond to a typical view of 
the Micro Interface during a grasping operation. This 
signature library was representative of more realistic 


libraries covering a larger range of pitch and yaw pa- 
rameter values. 

Using this signature library, four tests were per- 
formed: 

1. Random roll, pitch and yaw attitude estimation. 

2. Bearing and Range estimation. 

3. Range invariance test of roll, pitch and yaw. 

4. Bearing invariance test of roll, pitch and yaw. 

For consistency with the ray- tracer program, the tar- 
get’s attitude in all four tests was represented using 
three Euler angles, which measure attitude through a 
set of three rotations about the z, x, and again z axes, 
in the camera’s frame of reference (the image plane 
coincides with the xy-plane, and faces the negative z 
axis). We denote these three rotation angles by o, /?, 
and 7 , respectively. The translation components, range 
and bearing, of the image plane around the x-axis and 
y-axis were denoted with z, 0 , and p respectively. 

The tests provide evidence for the viability of the 
approach to 3D attitude and position determination. 
The procedure accurately estimates the position and 
the three attitude parameters of the object. The al- 
gorithm shows invariance to the range and bearing of 
the target for the estimation of the 3D attitude. These 
test results are described in the sections 3. 1-3.4. 

Roll, Pitch, and Yaw Estimation 

A random set of 10 target images with three arbi- 
trary Euler angles was used to test the algorithm’s abil- 
ity to correctly determine the target’s attitude. The 
exact and estimated Euler angles are shown in Table 1. 

The average error in any one parameter is 0.6®. The 
maximum error occurred for the a parameter of Im- 
age J, a difference of 2.7®. For this image, the wrong 
library signature was selected in the matching stage. 
The difference in the 7 parameter partially compen- 
sates for this error, reducing the combined a-l -7 angu- 
lar error for this image to only 1 . 2 ®. 

Bearing and Range Estimation 

A set of 4 target images was used to test the accuracy 
of the procedure for the bearing parameter and a set of 
6 target images was used to test the accuracy for the 
range. The exact and estimated parameters for range 
and bearing are shown in Table 2 and Table 3. The 
average error is 2 . 2 cm for the range estimate and 0 . 1 ® 
for the estimate of the bearing. 

Range Invariance 

A set of 14 target images was used to test the al- 
gorithms sensitivity to the target’s variation in range. 
The range of the target in the training images, used 
to generate the signature library, was 30cm from the 
image plane. The exact and estimated Euler angles are 


567 


Table 1: Attitude estimation test results. 


Image 

Exact Angles 
(degrees) 

a P 1 

Estimated Angles 
(degrees) 
ot P 1 

A 

16.09 

22.63 

-8,55 

16.57 

22.15 

-9.98 

B 

15.24 

20.46 

2.63 

15.60 

20.33 

2.81 

C 

18.39 

23.27 

7.69 

19,29 

22.44 

6.19 

D 

18.40 

22.08 

-4.55 

17.91 

21.89 

-3.65 

E 

19.67 

23.51 

-1.27 

20.57 

23.11 

-1.55 

F 

16.92 

24.55 

5.33 

16.90 

24.36 

4.78 

G 

17.60 

23.81 

-0.45 

17.86 

23.51 

-0.14 

H 

19.15 

21.31 

-5.24 

17.95 

21.38 

-3.51 

I 

15.17 

20.24 

-4.50 

15.32 

20,01 

-4.22 

J 

15.27 

23.68 

-2.81 

12.50 

24.74 

-0.70 


Table 2: Range estimation test results. 


Image 

Range 

(cm) 

Estimated Range 
(cm) 

A 

31 

31.17 

B 

35 

35.70 

c 

39 

40.00 

D 

45 

46.30 

E 

60 

62.10 

F 

90 

98.00 


shown in Table 4, for various target ranges. Angles are 
measured in degrees, range in centimeters. 

The errors incurred are moderate, and degrade as 
the range increases. The maximum error occurred for 
the a parameter of Image L, a difference of 5.0®. For 
this image, the wrong library signature was selected in 
the matching stage. The difference in the 7 parame- 
ter partially compensates for this error, reducing the 
combined a -h 7 angular error for this image to only 
1.5®. 


Table 4: Range invariance test results. 


Image 

Range 

(cm) 

Exact Euler Angles 
(degrees) 

a P J 

♦ 

30 

17.500 

22.500 

0.000 

Image 

Range 

(cm) 

Estimated Euler Angles 
(degrees) 

a P J 

A 

30 

17.553 

22.342 

0.000 

B 

31 

17.455 

22.199 

0.000 

C 

32 

17.514 

22.193 

0.000 

D 

33 

17.463 

22.116 

0.000 

E 

34 

17.467 

21.989 

0.000 

F 

35 

17.436 

21.827 

0.000 

G 

36 

17.468 

21.533 

0.000 

H 

37 

17.492 

21.529 

0.141 

I 

38 

17.412 

21.257 

0.141 

J 

39 

17.521 

21.351 

0.000 

K 

45 

18.068 

20.050 

0.140 

L 

60 

22.500 

17.500 

-3.518 

M 

90 

17.989 

20.885 

0.140 

N 

150 

17.776 

21.297 

0.140 


Table 3: Bearing estimation test results. 


Image 

Bearing 

(degrees) 

Estimated Bearing 
(degrees) 

A 

1 

0 

1.00 

0.02 

B 

2 

0 

2.00 

0.05 

C 

3 

0 

3.07 

0.13 

D 

4 

0 

4.14 

0.18 


Bearing Invariance 

A set of 5 target images was used to test the algo- 
rithm’s sensitivity to the target’s variation in bearing. 
The bearing of the target in the training images used 
to generate the signature library was 0® from the image 
plane’s normal. The exact and estimated Euler angles 
are shown in Table 5, for various target bearings, away 
from the image plane’s normal, in the direction of the 
positive y-axis. Both Euler angles and bearings are 
measured in degrees. 

The errors incurred are moderate, with a maximum 
error in the (3 parameter of Image E, a difference of only 
0.4®. Bearings of more than 4® would have brought the 
target partially outside the field of view of the camera, 
and were not tested. 
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Table 5: Bearing invariance test results. 


Image 

Bearing 

(degrees) 

Exact Euler Angles 
(degrees) 

a P y 

♦ 

0.0 

17.500 

22.500 

0.000 

Image 

Bearing 

(degrees) 

Estimated Euler Angles 
(degrees) 

a /? 7 

A 

0.0 

17.553 

22.342 

0.000 

B 

1.0 

17.435 

22.510 

0.000 

C 

2.0 

17.472 

22.549 

0.000 

D 

3.0 

17.385 

22.735 

0.000 

E 

4.0 

17.337 

22.939 

0.000 


Verification Method Results 


marks, an existing implementation of the algorithm can 
be quickly adapted to a different object, by supplying 
a signature library for the new target. Moreover it is 
not necessary to possess a physical model of the object 
because it is possible to generate the signature library 
with a ray-tracer program . In addition the signatures 
require only IK bytes of memory each. Thus for a typ- 
ical signature library of 225 signatures, the signature 
library is smaller than one 512 by 512 image. 

The algorithm relies on standard image processing 
routines (e.g. edge extraction, 2D Fourier Transforma- 
tion), which are available in numerous image processing 
libraries, and fast hardware implementations. 

The 2D Fourier Transformation, which is the most 
time consuming part of the procedure is readily paral- 
lelized, so that a real time version of the algorithm can 
be achieved by distributed hardware. 

Although this method was developed under the as- 
sumption that there is no clutter in the image and that 
the target is of a known type, these constraints can be 
lifted using additional initial scene analysis. For ex- 
ample, the scene can be segmented using standard im- 
age processing algorithms, and potential objects can 
be compared to known objects via signature matching 
and match verification. 


Pairs of krx}wr> arxf astimatad ofiantationa 



Figure 8: Matching random test points to best overlay 
candidate (candidates are on a 5 degree spaced grid) 

Figure 8 shows the overlay matching results. Each 
random test point is marked with and has a corre- 
sponding (correspondence is indicated by a connecting 
line) estimate denoted by ”o”. The estimates in this 
example fall on a 5^ x 5^ grid. As seen by the figure, 
all but one of the matches fell on the nearest grid point 
(i.e. the estimates were within 5 degrees). 

Conclusion 

A procedure has been developed to determine the 3D 
attitude and the position of machined objects without 
the use of any special marks. Since there is no need for 
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Appendix 

In order to remove background clutter we use infor- 
mation about the constraints of our scene with respect 
to our target. This appendix discusses in detail the im- 
age preprocessing techniques which we used in connec- 
tion with the robotic grappling application discussed 
in this paper. 

Preprocessing the Raw Image 

We start our processing given a single frame 256 gray 
scale image, I. The image / is filtered three times 
producing three more useful images. The first is a low 
pass filtered version of the raw image, denoted by 7. 
The next two filtered images are the x and y gradients 
of the raw image, denoted as Vx7 and Vy/ respectively. 
Two binary edge images are then constructed using the 
above filtered images. 

The first edge image is a thin edge image, E, found 

by, 

f[\/(v^7FT(v^ 

U U + i/2) 
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where is treated as pixel- wise binary output op- 
erator. The above equation attempts to enhance lo- 
cal edge information by dividing the magnitude of the 
gradient of the raw image by the average neighbor- 
hood pixel intensity. Thus small intensity variations in 
dark regions could be equivalent to larger variations in 
lighter regions^ 

The second edge image is a thick edge image, , 
defined as, 


E-^ = 


I 


\/(Vx/)^ + (Vy/)^ 

U+1/2) 



( 7 ) 


The image E'^ is nearly identical to E except that 
the threshold used is lower. Thus, E'^ contains more 
white pixels (pixels which satisfy the binary condition). 
Therefore, E C E'^ (i.e. every white pixel of E is 
a white pixel in E^). Note that while E provides a 
cleaner edge image, E'^ preserves the connectivity of 
the edge image. This connectivity will be used below 
to determine a processing region which rejects back- 
ground clutter. 


Rejecting Background Clutter 


In the discussed application we are interested in find- 
ing a handle which is mounted on a predominantly 
lighter background. In addition we assume that the 
handle structure will be larger than any unwanted clut- 
ter on the same background. Thus, we look for the 
largest edge structure in a dark region which is con- 
tained in a lighter region. This region tells us which 
information in E should be processed and which infor- 
mation should be rejected. Next we must determine 
what is light and what is dark as well as what is con- 
sidered an edge structure. 

Using the original raw image, 7, we generate a his- 
togram. This gray level histogram is then clustered into 
three fuzzy classes, dark pixels^ m edium pixels^ and light 
pixels by using a fuzzy c-means clustering algorithm^^. 

The light regions of the raw image are found us- 
ing the mid-point between the dark and medium pixels 
cluster centroids as a image threshold. This thresh- 
olded image is then segmented into blobs based on 
the pixels 8- connectivity^ . The connectivity analy- 
ses only reports significant blobs (blobs which contain 
a significant number of pixels). Out of all the signif- 
icant blobs found, the algorithm picks the one with 
the largest area (number of pixels) as the largest light 
region. 

Next, the raw image is thresholded by the mid-point 
between the light and medium pixel cluster centroids. 
This time, all pixels below the threshold are considered 
logical 1 and all above are logical 0. This new binary 
image is combined with E~^ using a logical pixel-wise 
and. The resulting binary image contains edge struc- 
ture in the dark regions of the raw image. 


The edge structure is applied to the connectivity 
analyses algorithm to find all significant connected edge 
structures in dark regions of the raw image. The re- 
sulting processing region is then determined to be the 
largest edge structure in a dark region which is within 
the largest light region. If no processing region is found, 
then the largest edge structure s a darA: region becomes 
the processing region. And if there where no signifi- 
cant edge structures in a dark region found a warning 
message is issued and the entire image is used as a 
processing region. 
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Abstract 

This paper discusses an application of the Genetic 
Algorithm^ a parallel and global search technique that 
emulates natural genetic operations. Real application 
problems often require optimization of a large number 
of parameters with high precision. Since the existing 
Genetic Algorithms do not represent the parameter sen- 
sitivities, we have devised a novel scheme of Hierarchi- 
cal Genetic Algorithm to solve complicated engineering 
problems. Using this approach, the higher level GAs 
propose promising search spaces, while the lower level 
GAs search in more detail with additional parameter 
sets. This decreases the complexity of search and uti- 
lizes the computing resources efficiently. This scheme 
has been used to design an autonomous control systems 
for space- based resource processing plants. 


1. Introduction 

Applications of computer technology are expanding 
from pure data processing to information and knowl- 
edge processing which enables Computer-Aided Sys- 
tem Design. Knowledge-based system applications are 
characterized by symbolic processing, nondeterministic 
computation, dynamic execution, high potential for par- 
allel and distributed processing and knowledge manage- 
ment. However, fundamental physical limits of current 
technology have not been overcome for the more sophis- 
ticated computation-intensive problems, such as predic- 
tive modeling and forecasting, design automation, large 
scale, simulation and artificial intelligence. The com- 
bination of technology and economic factors make par- 
allel and distributed computing systems attractive and 
effective for a large variety of intelligent machine appli- 
cations 1^1. 

The emergence of massively parallel computers has 
also fueled a growing interest in problem solving sys- 
tems based on principles of evolution and heredity. One 


class of such evolution strategies is Genetic Algorithms 
The remarkable success demonstrated by Genetic 
Algorithms (GA) in search, optimization and learning 
has substantially increased interest in their potential 
application to modeling, simulation and design of com- 
plex real world systems Such applications include 

identification and calibration in model construction and 
subsequent model-based control synthesis and policy 
optimization. However, complex simulations typically 
require large execution times to evaluate alternatives, 
or in GA terms, to obtain fitness values for newly gen- 
erated chromosomal individuals. Such lengthy simu- 
lations present a major bottleneck to GA application 
since tens, or even hundreds, of individuals may need 
to be evaluated in every generation. Parallel processing 
offers promise of reducing this bottleneck along two mu- 
tually supporting avenues: 1) speeding up the simula- 
tion needed to estimate fitnesses using distributed sim- 
ulation methods and 2) parallelizing the evaluation 
and processing of fitness information. Both avenues are 
under active investigation and indeed a computer archi- 
tecture to support their integration has been suggested 
[19] 

Real application problems often require optimization 
of a large number of parameters with high precision. 
These parameters increase the complexity of the search 
problem. In existing approaches, a chromosome rep- 
resenting the parameters does not contain information 
about their sensitivity, even though parameters influ- 
ence the system performance to different degrees. 

We have developed a novel scheme of a Hierarchi- 
cal GA optimizer which executes multiple GA modules 
to solve complicated problems. These GA modules are 
constructed hierarchically and creation/deletion is per- 
formed dynamically based on the performance of each 
module. 

Each GA module deals with a different degree of ab- 
stracted models for evaluation and a different number 
of parameters for optimization. High level GA modules 
usually search for fewer parameters which are more sen- 
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sitive to the system performance. They are looking for 
milestones of promising search region instead of accu- 
rate solutions. The candidate individual selected from 
the high level GA module represents a sub search space 
and they are sent to the lower level GA modules for 
more detail search. The lower level GA takes advan- 
tage of the received information, and employs a greater 
number of parameters for further optimization. 

The solutions found at the lower level GA module 
are reported back to the higher level GA module, if it is 
better than the candidate individual from parent mod- 
ule. This information is used to update the fitness of 
the parent. As the purpose of high level GA module 
is not to find actual solution, the models of this level 
are not necessarily accurate. In order to speed up GA 
search, the high level GA modules access less accurate 
models which can reduce simulation- based fitness eval- 
uation time. The basic concept is that of successive 
approximation provided by a nested sequence of mod- 
els 

2. Hierarchical Genetic Algorithms 
for Complex Problems 

A simulation model of such a complex architecture 
is most naturally formulated as a variable structure 
model A Hierarchical GA is implemented on a self- 
organizing variable structure, where creation/deletion 
of modules are determined by their performance. 

2,1 Brief Review of Asynchronous Genetic 
Algorithm 

The GA (genetic algorithm) is a probabilistic algo- 
rithm which maintains a population of individuals, P{t) 
= xi(f), . . . , arn(f) for iteration t. Each individual repre- 
sents a potential solution to the problem at hand, and, 
in any evolution program, is implemented as some (pos- 
sibly complex) data structure 5. Each solution x^(f) is 
evaluated to give some measure of its fitness. Then 
new population (iteration f -f 1) is formed by selecting 
the more fit individuals (select step). Some members 
of new population undergo transformation (recombine 
step) by means of ^‘genetic” operators to form new so- 
lutions. There are unary transformation m, (mutation 
type), which create new individuals by a small change 
in a single individual (m : 5 - 5) and higher order trans- 
formations Cj (crossover type), which create new indi- 
viduals by combining parts from several (two or more) 
individuals The control parameters for genetic op- 
erators (probability of crossover and mutation) need to 
be carefully selected to achieve acceptable performance 
After some number of generations the program 
converges and is successful if the best individual repre- 
sents the optimum solution. 

We have developed concepts for parallel genetic al- 
gorithms that are especially oriented to simulation- 



Figure 1: The Architecture for Asynchronous Genetic 
Algorithm Simulation 

evaluated individuals on high performance computers. 
We have investigated a class of Asynchronous Genetic 
Algorithms (AG As) which does not need to be syn- 
chronized by generations to create successive popula- 
tions. In a multiprocessor architecture, individuals are 
evaluated concurrently and a central agent updates the 
genetic population continuously as the evaluation re- 
sults become known. The motivation behind such asyn- 
chronous updating is the recognition that not only may 
simulation runs be time consuming, but their comple- 
tion times may be highly variable. This variability is 
quite common in performance measuring simulations ^ . 
When such variability is significant, the barrier synchro- 
nization imposed by conventional GAs can greatly im- 
pede search progress since it requires that processing 
cannot proceed to the next generation until the slowest 
individual in the current population has completed its 
evaluation In contrast, the AG A allows new indi- 
viduals to be tested ais soon as both the information and 
the computer resources are available to do so. 

A concern immediately raised in the AGA paradigm 
is that the blending of generations occasioned by such 
asynchronized processing might adversely affect the re- 
combination schemes underlying GA search. Certainly, 
the supporting theory typically limits selection, mating, 
crossover and other operations to members of the same 
generation t®’ Fortunately, some results in the litera- 
ture suggest that search time and search success are not 
degraded, at least in application to typical test function 
suites We note that such artificial fitness functions 
do not include time dependence that might appropri- 
ately suggest the necessity for generational integrity. 

As shown in figure 1, the processing elements (PE) 
in the asynchronous genetic algorithm can be catego- 
rized as : genetic population PEj evaluation PEs and 
control PE. While the PGA executes serial GAs in the 

^It certainly occurs when runs are executed in conditional 
mode where termination depends on pre-established criteria (e.g., 
a run may be terminated as soon as failure to achieve a prescribed 
goal is obvious). However, run time variability may £dso airise 
when runs span a fixed observation interval (on the model time 
base). This may be due to the variability in workload encoun- 
tered by the simulation engine or in the resources allocated to 
the particular trial 
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multiple processing elements with subpopulations, the 
asynchronous genetic algorithm evaluates a single in- 
dividual in a processing elements {evaluation PE) at 
a time. The total population is always contained in 
the genetic population PE which executes genetic op- 
erations and updates the population. It also generates 
new individuals whenever it receives request from the 
control PE, 

Message transfer between the genetic population PE 
and the evaluation PE is controlled by the control PE : 
it delivers evaluated individuals to the genetic popula- 
tion PE and requests new individuals for the evaluation 
PE. Thus the evaluation PEs keep evaluating individu- 
als with which the genetic population PE continuously 
updates the population. 

2.2 Resolution Increasing Scheme in the 
Hierarchical Genetic Algorithm 

A binary chromosome, a unique knowledge represen- 
tation scheme of Genetic Algorithms, provides a way 
of controlling the accuracy of parameters. The size of 
the binary code determines the number of points to be 
investigated. As more bits are employed, the search 
points increase dramatically (exponentially). Therefore, 
longer string size may provide accurate parameter val- 
ues, but it also makes the GA to search through a large 
number of points. 

As shown in figure 2, same size of binary code can 
increase accuracy as search space changes. At level 1, 
the original search space is defined by MIN and MAX. 
We employs 3 bits and there are 8 possible search points. 
If a certain point (binary code) is selected, the distance 
between its neighbors becomes a new sub search space. 
Therefore the selected candidate individual at the high 
level GA module has meaning as representation of its 
neighbors (sub search space) rather than an actual value 
itself. The same size of binary code is employed with 
the new search space, which increases the accuracy of 
the parameter value. 

2.3 Expanding Search Parameters in the 
Hierarchical Genetic Algorithm 

The previous section explains how search accuracy is 
controlled by the Hierarchical GA. If the search prob- 
lems involve a large number of parameters, the G A takes 
longer or directs to local minima. The Hierarchical GA 
employs an expanding search parameter scheme. The 
higher level starts to search for a small number of pa- 
rameters. The result obtained at the higher level is sent 
to the lower level GA, where extra parameters are in- 
cluded to the received parameters. The lower level GA 
takes advantage of the received information so that it 
need not search all the parameters from the beginning. 

The expanding parameter scheme in the Hierarchical 
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Figure 2: Resolution Control Scheme in Hierarchical 
Genetic Algorithms 

GA also helps determine the optimal number of param- 
eter sets. Complicated designing problems involve the 
evaluation of a large number of parameters, where the 
type of parameters as well as their appropriate values 
are often unknown. 

2.4 Execution of Multiple GA Modules 
in Parallel 

The selected individuals during the search operation 
from the root AGA create lower level GA modules as 
shown in figure 3. The time taken by the module for 
checking its population and selecting a candidate in- 
dividual may be either deterministic or nondeterminis- 
tic. In this experiment, we choose a variable selection 
interval scheme, in which the time intervals of subse- 
quent checking become larger as the GA search con- 
tinues. This strategy is based on a characteristic of GA 
search, the fitness of population increases faster in early 
search stages and saturates to a certain level. Smaller 
checking intervals enables the GA to choose new candi- 
dates before stagnation. 

When a lower level AGA module selects an individual 
as a candidate for the next level, it also reports fitness 
to the parent AGA module. This feedback information 
updates the fitness of parent individual. But the in- 
dividual structure of the parent and child may not be 
comparable to each other because they represent differ- 
ent parameter sets or different search space. The popu- 
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Figure 3: Creation and Execution of multiple AG A in 
Self-Organizing Hierarchical GA 


lation of child AG A searches through the space which is 
suggested by a candidate individual from parent AGA. 
The fitness of the lower level individual, if better, up- 
dates the fttness of parent individual. If updated, the 
fitness of parent no longer represents an actual fitness 
of the binary code. But, since the fitness is increased by 
a certain amount, its value can not be represented in its 
environment. The individual now has a higher fitness 
and has a greater probability of being selected for the 
next evaluation. 

Figure 4 shows the module components at each 
level. Each module contains a controller which ex- 
ecutes AGAs to solve a given problem and select 
candidate individual(s) reported from AGAs. It cre- 
ates/deletes lower level modules and communicates with 
higher/lower level modules. This controller is the in- 
telligent component of module and is implemented by 
a rule-baised expert system. The module is defined 
as a class in an object-oriented programming environ- 
ment (Chez-Scheme 1^1) and the same module struc- 
ture is created by higher level object. This module is 
program-interfaced to an AGA procedure written in C. 
The decision making component is implemented in a 
symbolic processing language, while the numeric com- 
putation procedure is written in C. 

When the module receives a problem, it may expand 
search parameters as well as increase resolution. With 
any given problem, the controller first expands parame- 
ters and sends them to the PARA- AG A for GA search. 
The selected individuals at the PARA- AGA are sent to 
HIGH- AG A to increase resolution by the scheme ex- 
plained above. The fitness of the selected individual 
at the PARA-AGA are also reported to the high level 
HIGH-AGA to update the fitness of parent individual. 
The candidate individuals for the next level are selected 
at the HIGH-AGA where the selected individual is also 



Figure 4: Module Components in the Self-Organizing 
Hierarchical GA 

reported to the PARA-AGA of same module. 

3. Design of Control System using 
Self- Organizing Hierarchical GA Environment 

A working prototype of a plant for producing oxy- 
gen from Martian atmosphere, is constructed at NASA- 
UA Space Engineering Center The purpose is to 
evaluate the best designs and operation parameters for 
the Mars mission. Martian C02-rich atmosphere is fil- 
tered and compressed to a temperature and pressure 
suitable for electrocatalysis in a Zirconia- based oxygen 
cell. Design issues include the size of the inlet pipe, 
power requirements of the compressor and design of the 
oxygen cell including: cell configuration, material prop- 
erties, electrical parameters such as operating voltage 
and current density, electrode materials, and method of 
application 

In this experiment, we try design an optimal FLC to 
control the temperature of the oxygen production sys- 
tem. The basic idea of the fuzzy control centers around 
the labeling process, in which the reading of a sensor is 
translated into a label as done by human expert con- 
trollers With expert supplied membership func- 

tions for this labels, a reading of a sensor can be fuzzified 
and defuzzified. It is important to note that the transi- 
tion between labels are not abrupt and a given reading 
might belong to several label region. 

The fuzzification and defuzzification processing does 
not need to be sequential. The input signal can be 
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(a) Fuz^ Inference Network 



Figure 5: Fuzzy inference network and fuzzy sub- 
spaces: (PLiPositive Large) (PS:Positive Small) 

(ZE: Zero) (NS: Negative Small) (NL:Negative Large) 

fuzzified/defuzzified simulatenously by matching mem- 
bership functions. Therefore fuzzy control processing 
can be adapted to a parallel neural network structure 
where each neuron represents functions (fuzzy member- 
ship) and links represent the weight of a fuzzy rule. 

Figure 5(a) shows the structure of the Fuzzy Neural 
Net Control System (FNC) and its fuzzy subspace (Fig- 
ure 5(b)) In this experiment, 5 input membership 
functions are assigned to each input signal and 5 out- 
put membership functions are used to compute fuzzy 
output signal. 

While an earlier Fuzzy Logic Controller was 

implemented in rule-based form {if-ihtn), the FNC em- 
ploys a parallel inferencing network structure. Due 
to the parallel fuzzification/defuzzification scheme, the 
FNC can improve real-time performance of the control 
system for practical application. 

The performance of the FNC is determined by the 



Figure 6: GA optimization of the FNC module 
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Figure 7: Oxygen Production System Cluster 

input membership functions of layer 1 which fuzzify the 
input signals and the output membership functions of 
layer 4 which defuzzify normalized firing strengths. A 
membership function is specified by number of param- 
eters. 

In order to find a high performance fuzzy member- 
ship functions without the help of human expertise, 
it is necessary to employ computer-aided optimization. 
Since tuning the membership functions requires ad- 
justing many parameters simultaneously, hill-climbing 
search methods would suffer from the complexity of the 
search space. 

For this reason, a probabilistic optimization method 
utilizing evolution strategies, such as Genetic Algorithm 
(GA), was employed to find optimal membership func- 
tions. Since optimizing multi-parameter problems takes 
a long time, we developed new form of GA which is es- 
pecially oriented to parallel computers that can satisfy 
the real-time constraints of the system. 

Figure 6 shows the interaction of the FNC, simula- 
tion model and GA-optimizer. The FNC operates the 
simulation model, such as heater /cluster model of the 
Mars Oxygen Production System (OPS). 

The OPS, shown as in figure 7, includes Zirconia 
tubes located symmetrically inside a cylinder. A radia- 
tion heater is wrapped around the outer surface. With 
this configuration, the majority of heat transfer between 
the outer surface and the oxygen gas inside the system 
is due to radiation. Applying the one-dimensional heat 
equation with lumped temperature distributions for the 
surface and oxygen temperatures we obtained two first 
order differential equations as provided below. The Tp 
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3 Membership Functions 



Figure 8: Individual evaluation procedurerFNC opera- 
tion with heater/cluster thermal model 


represents the pipe temperature and Tz is Zirconia tube 
temperature. A variable SW is either 0 or 1 to control 
the heat source (heater). The objective of the FNC is 
to increase the temperature of the Zirconia tubes at a 
constant rate until a goal temperature is reached 

^ = 2.75 X 5PV - 4.42 x 10" _ T^) - 8.65 x 
lO-\Tp - 278.0) 

^ = 4.42 X 10-i2(Tp4 - T^) 

As shown in figure 5, there are two input signals to 
the FNC e.g., temperature increase error rate (input 1) 
and rate of its error rate (input2). Based on two inputs, 
the FNC produces an output command which controls 
on/off duty cycle of the heater element in the model. As 
shown in figure 5, we employed 5 membership functions 
for each input signal and 5 membership functions for 
the output signal. 

Figure 8 provides detailed procedures of the FNC in- 
tegrated with the GA-optimizer. An individual of a GA 
represents one trial set of fuzzy membership functions. 
The G A optimizer sends a parameter assignment to the 
FNC which determines its fuzzy membership functions. 
The model is reset to its initial conditions (starting tem- 
peratures). The operational specifications such as de- 
sired temperature increase rate and goal temperature 
are set inside the controller. The performance of a 
trial individual fitness is measured as the sum of the 
MSE (Mean Square Error) between actual temperature 
increase rate and desired one and maximum absolute 
value of error of temperature increase rate. 

3.1 Design of the FNC for the Oxygen 
Production System 

Our primary objective is to design optimal fuzzy 
membership functions that perform well with given op- 
erational specifications while utilizing minimal human 
expertise. The controller increases the temperature of 
the cell at a constant rate. 




5 Membership Functions 



Designing an optimal FLC involves the investigation 
of several alternatives, such as type of membership func- 
tions and the number required. A single-level GA starts 
to optimize the FLC based on the assumption that 
a given FLC specification, such as type or number of 
membership functions, is optimal. But real world ap- 
plication problem is often too complicated to determine 
the correct system specification. 

Hierarchical GA solves this problem by changing its 
structure according to the performance of each module 
which employs a different number of fuzzy membership 
functions and parameter resolution. Starting from a 
small number of parameters, it expands search parame- 
ters and their resolution as they create lower levels. The 
lower levels take advantage of information found at the 
upper level AGA module. 

Figure 9 shows how search parameters are expanded 
as Hierarchical GA creates lower level modules in the 
example of designing a FLC. The upper level module 
starts to design the FLC with a small number of mem- 
bership functions. Designing a FLC with fewer mem- 
bership functions is relatively easy compared to a large 
number of membership functions. Even though the up- 
per level need not find the best membership function, it 
does provide some information to the lower level which 
supports the design of an optimal FLC. Since the mem- 
bership functions found at the upper level are optimized 
based on constraints of a small number of parameters, 
the lower level GA modules give small tolerances to the 
received parameters. This is due to the effect of new pa- 
rameters on the old parameters optimized earlier. Fig- 
ure 9 illustrates how to expand membership functions, 
the shade area of membership function represents its 
tolerance. 

As we increase the number of employed fuzzy mem- 
bership functions, the fuzzy rule table must also be ex- 
panded. Figure 10 shows ways of adding more slots to 


576 




5 3 mput-B So-Far-Bost Fitnoss Improvomeni 



0 1000 2000 3000 4000 5000 


Figure 12: Simulation Results from Self-Organizing Hi- 
erarchical GA to design optimal FLC 

improvement of Hierarchical GA shows that when a 
certain GA module is executed, the fitness increases 
suddenly. The FLC specifications of the module pro- 
vides the best performance among other FLC specifica- 
tions. The population of each module represents differ- 
ent species, because they expresses different number of 
parameters and search spaces. When a certain species 
(the correct one) is created, the performance of the Hi- 
erarchical GA improves in a step like manner. The tem- 
perature profile shown in the figure 12 is that of by the 
suggested optimal FLC in the Hierarchical GA. 

4. Conclusions 

Real world application problems often require opti- 
mization of a large number of parameters with high 
precision. These parameters increase the complexity of 
the search problem. In existing GA, a chromosome rep- 
resenting the parameters does not contain information 
about their sensitivity, even though they influence the 
system performance to different degrees. 

We have devised a novel scheme of Hierarchical Ge- 
netic Algorithms in self-organizing variable structure 


Figure 11: Tree of Various FLC Specifications 

the fuzzy rule tables. Since the fuzzy rules are optimized 
based on constraints of fewer membership functions (for 
example, 3 input A,B, and 3 output membership func- 
tions), the expanded fuzzy rules need to be optimized 
with not only more slots but also some degree of toler- 
ance of suggested rule parameters. 

Figure 11 shows a tree of various specifications of the 
FLC in which the Hierarchical GA searches through op- 
timal design. Hierarchical GA first optimizes fuzzy rules 
which are more sensitive to the FLC performance at the 
root module. The small number of fuzzy membership 
functions with small parameter variance were used in 
order to maximize the sensitivity of fuzzy rules. The 
fuzzy rules found at the root module are sent to lower 
levels, where two different membership function types, 
such as triangular and bell shape are employed. The 
lower levels have wider search ranges in the parameters 
and utilize the fuzzy rule information received from the 
root. A greater number of fuzzy membership functions 
are employed when the lower level GA modules are cre- 
ated. 

Figure 12 shows the simulation results that illustrate 
how the Hierarchical GA investigates various FLC spec- 
ifications to design an optimal controller. The fitness 
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environment for complex real world problems. The de- 
sign of a temperature control system for the oxygen pro- 
duction plant was selected as an experiment. Since con- 
ventional control schemes are limited their functional- 
ity to relatively simple applications, Fuzzy Logic/Neural 
Net control methods are received more attention for the 
sophisticated applications. The parameters embedded 
in the controller need to be optimized for the required 
control performance. 

In this paper, a Hierarchical GA investigates various 
FLC specifications using variable structure simulation. 
More sensitive parameters, such as fuzzy rules, are opti- 
mized before other parameters. Higher level G As search 
for candidate individuals that might contain the opti- 
mum in a given search space. These candidates are sent 
to the lower level to be investigated in greater detail. 
If better solutions are found at a lower level, they are 
reported back to the higher level and incorporated into 
its on-going GA search. The higher level GAs search 
in a sparse space with fewer parameters that influence 
the system performance significantly. In order to reduce 
GA search time, higher levels also utilize less accurate 
models for which simulation-based evaluation time is 
reduced. 

The simulation exhibited interesting search behavior 
: when a good GA module is discovered, the perfor- 
mance increases suddenly. This suggests that not only 
has a good design been found, but that all other design 
frameworks can be eliminated. 
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Abstract 

The efficiency of an autonomous robot navigating in 
indoor environments depends crucially on the ability of 
the robot to exploit spatial relationships extracted from 
perceptions of its environment. Smith, Self, and 
Cheeseman^^ describes a formalism based on Kalman fil- 
ter theory, where perceptions from different locations can 
be combined to improve the accuracy of the robot pose 
estimate. We argue that while accuracy is an important 
property of perceptions of the robot state, a more impor- 
tant property of perceptions of the environmental state are 
their temporal and spatial range of applicability, which 
will be referred to as perceptual relevance. This paper 
introduces a relevance measure based on Jaynes maximum 
entropy principle, measuring the relevance of a spatial 
description of the robot environment. The conjunction of 
accuracy and relevance is denoted information quality. A 
formalism based on the information quality concept is 
developed for the class of one-agent applications, for 
which the formalization of the dependency between per- 
ceptions and actions of a robot is straightforward. 

1 Introduction 

A robot can be viewed as a controller, the purpose of 
which is to transform the current system state into a goal 
state. After having executed the action sequence, the sys- 
tem state should be closer to a goal state. If we by “world” 
denote the conjunction of robot and system, this paper is 
based upon that essentially three issues determine the per- 
formance of the state transformation process: 1) The accu- 
racy and relevance of the robot’s perception of the world 
state, 2) The robot’s capability to find an action sequence 
that forces the current model state into a desired goal state, 
and 3) The precision of the transformation from abstract to 
physical actions. 

The behavior of the robot corresponds to what actions the 
robot selects to execute in a particular situation. For sen- 
sorless robots, behavioral information in the form of 
action sequences are given a priori, and may not change 
due to external events during operation. Although this is a 
straightforward way to implement robot behavior, the 


robot requires a well-defined working environment where 
the properties of each object must be accurately specified. 
In effect, all information is given to the robot a priori, and 
a major problem is to maintain a configuration of the 
working environment that is consistent with the specifica- 
tion. 

More flexibility is achieved if the robot is capable of 
acquiring information about the true configuration of the 
working environment during operation. Robots capable of 
acquiring information during operation may be classified 
as being either reactive^ or deliberate. While reactive 
behavior commonly is hard-wired into the robot, deliber- 
ate behavior is exhibited by robots maintaining an explicit 
world model. 
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Fig. 1. Structure of a deliberate robot 

Obviously, reactive robots are inherently autonomous, 
while deliberately behaving robots may be anything from 
autonomous to tele-operated. In the autonomous case the 
explicit model is implemented in the robot software, while 
in the tele-operated case the model exists in the mind of 
the expert controlling the robot. Issues affecting the per- 
formance of a deliberatively behaving robot will be 
addressed in this paper. Throughout the paper, except 
where explicitly stated, the only assumption being made 
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concerning the deliberately behaving robot is that it pos- 
sesses an explicit world model. 

In Fig.l, a useful way to describe the structure of a delib- 
erate robot is presented. By splitting the graph horizontally 
at increasing heights, one gets the following sequence of 
dichotomizations: environment/agent, hardware/software, 
system/cognition. The figure also illustrates the cyclic pro- 
cessing of perceive-reason-act which starts when a task 
has been given. 

A distinguishing feature among deliberate robots is the 
degree of human interaction, spanning from autonomy to 
tele-operation. For an autonomous robot the “reasoning” 
module (see Fig.l) corresponds to a computer program 
while for a tele-operated robot the it corresponds to the 
human operator. Albeit having this difference, all deliber- 
ate robots need high quality information in order to do 
proper inference. By keeping the information quality 
above some pre-defined level, the likelihood of erroneous 
inference is kept sufficiently low. In this paper we will 
develop means for preserving the information quality, 
which are applicable to a large class of deliberate robots. 

The formalism developed in this paper is based on an 
assumption of a one-agent application. The formalism 
might however be extended to cover many-agent applica- 
tions as well. One distinguishing feature between one- and 
many-agent applications is that while failure to execute a 
plan in a one-agent application is caused either by poor 
information or by poor control, for many-agent applica- 
tions an additional cause of plan execution failure are 
actions executed by other agents. Without doing any fur- 
ther elaboration on the class of many-agent applications, it 
suffices to notice that more powerful models must be 
developed and that real-time constraints become cruciai^^. 

The robot uses the world model for interpreting the 
present situation. Therefore, it is of great importance that 
the model discrepancy is small. On the other hand, a too 
detailed model suffers from high time and space complex- 
ity. The difficulty to satisfy these somewhat contradictory 
constraints is one reason why many deliberate robots are 
either too slow or too error-prone. At the core of the prob- 
lem are the issues of uncertainty and complexity. Typi- 
cally, reducing the complexity of a model increases the 
uncertainty and vice versa. However, by using application- 
specific heuristics, it is possible to suppress world proper- 
ties of minor importance, thereby simplifying the model. 
In this way a model with both low discrepancy and moder- 
ate complexity can be establish. For example, in the 
mobile robot case, a 2D (global) map suffice for robot nav- 
igation, while a 3D (local) map is needed for many object 
manipulation tasks. By using this heuristics, a 2D/3D 


composite model is created, with a fair trade-off between 
complexity and discrepancy. The problems to represent 
and reason under uncertainty have been addressed in sev- 
eral papers^^’ .Although this is an important 

research area, the work do normally not consider the prob- 
lem of uncertainty and relevance maintenance, in particu- 
lar not when the uncertainty and relevance varies in time 
and space. 

Having developed an appropriate model, the next question 
is why and when the model should be updated with fresh 
information? In order to answer the first question, we reca- 
pitulate that the model should have limited space and time 
complexity. This inevitably leads to an information loss. 
Accordingly, situations may occur where ignorance of 
some world property will result in robot malfunction, 
although such situations may be very rare. Crucial for the 
prevention of robot malfunction is to maintain a low 
model discrepancy, since the next action to execute is 
determined by the model interpretation. Obviously, the 
penalty for having a model of low complexity is that it 
must be updated frequently to keep the discrepancy low. 

When to update the model depends on how low one want 
to keep the likelihood of robot malfunction. While it is 
obvious that a high model discrepancy increases the risk 
of robot malfunction, it is very difficult to calculate the 
probability of robot malfunction as a function of the model 
discrepancy. The reason for this is that in order to calculate 
the discrepancy the model state must be compared to the 
world state, which contains an infinite number of ele- 
ments. An approach to this problem is to introduce a 
threshold value for each perception, representing the mini- 
mum permitted information quality of the perception. 
After each executed action during the execution of an 
action sequence, the robot checks that each perception has 
a quality exceeding the corresponding information quality 
threshold. If this is not the case, a sensing action is exe- 
cuted to increase the information quality of the percep- 
tions. Otherwise, the next action in the sequence is 
executed. 

2 Model - world duality 

The correspondence between the model and the real-world 
is established through the following definition. 

Definition 2-1 To find a solution to a real-world problem is 
analogous to the abstract problem of finding a path, p, sat- 
isfying the predicate Q(p), from an initial state to a goal 
state in a model state space, S. 
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For instance, Q(.) might represent the predicate “short- 
est.)”, “least expensive(.)”, or “most safe(.)”. 

Although Def. 2-1 is rather general, a straightforward 
interpretation within the framework of this paper is possi- 
ble, The problem space, S, corresponds to the space of 
possible model states. The path, p, corresponds to a 
sequence of actions, while Q(t) is interpreted as “Informa- 
tion Quality high enough along the path”. 

Ideally, the abstract plan is in perfect agreement with the 
solution to the real-world problem. In practice, though, 
this is rarely the case and the robot must perform plan val- 
idation iteratively during the execution. The efficiency in 
the detection of plan failures and the subsequent re-plan- 
ning is closely related to the performance of the robot, and 
could accordingly be taken as a measure of the same. 

Each cycle of processing in FigJ corresponds to the tran- 
sition between two states in the problem space - ideally 
towards a goal state. 

This duality between the cyclic processing of the physical 
system and the problem space transitions suggests the pos- 
sibility to analyze plans in the problem space before exe- 
cuting them on the physical system. 

3 C on ce pts 

One-agent applications 

A large class of robotic applications are one-agent applica- 
tions. In a one-agent application, it is assumed that all 
changes to the system state are caused by the actions of the 
sole agent. Consequently, for one-agent applications it is 
straightforward to formalize the dependency between 
actions and perceptions of an agent. 

World 

The world is composed of two entities, agent and environ- 
ment. This is in accordance with the one-agent application 
assumption. The agent may correspond to either an auton- 
omous or a tele-operated robot. 

Sensing 

In order to gather information about the world, the agent 
must use sensors. Sensors measure either the state of the 
agent or the state of the environment. Sensing, a, maps the 
world state to perceptions, which in turn can be mapped to 
a more abstract perception, or be combined with previous 
perceptions to give a more accurate perception. 


Perception 

Perceptions provide descriptions of details of the world. 
To distinguish between perceptions with different proper- 
ties, perceptual classes are introduced. Within each per- 
ceptual class, perceptions are distinguished by their 
creation time. Accordingly, a perception from perceptual 
class z, created at time k will be denoted Pj^. Perceptions 
acquired by the agent up to time k, where ke N /is repre- 
sented by the perception vector P|^ 

Pk = (PvPz’ —>Pr) w 

where r is the number of perceptual classes and p, corre- 
sponds to a set of acquired perceptions of the z.th percep- 
tual class, that is 

Pi - ^Pc,k-k^’ Pc,k-k^> ■•■’Pc,k-k,^ 

where 0<k^< ... <k 2 <k^<k ■ 

Perceptions are created either by using data from one or 
more sensors or by combining previous perceptions into a 
more abstract or more accurate perception. To create more 
abstract perceptions, feature extraction algorithms are 
applied, while to create more accurate perceptions spatio- 
temporal dependencies among the previous perceptions 
are exploited. 

Action 

The set of actions that the agent can execute is denoted A. 
Actions, a, € A , are used for changing the world state. 
Actions can be identified as being of either manipulatory, 
navigational, or sensing type. Accordingly, three action 
classes are introduced. Actions from the manipulatory 
action class change the state of the manipulator, which in 
some cases also changes the state of the environment. 
Actions from the navigational action class correspond to 
the movement of the agent to a new location. Sensing 
actions correspond to the acquisition of information about 
the world state. 

Reasoning 

Given perceptions of the world state, the agent performs 
reasoning, p, to determine what action sequence to exe- 
cute. 
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Effectuating 

The robot is using effectors to translate actions into 
changes of ilie world state. Effectuating, £, therefore maps 
actions to world state changes. 

Functional description of an agent 

In a one-agent application, it is possible to describe per- 
ceptions as functions of actions or vice versa. This is illus- 
trated in Fig. 2. 



Fig. 2. Functional description of an agent 

Following is the interpretation of Fig. 2: Perceptions, P, are 
mapped by reasoning, p, to an appropriate action sequence 
(A)". The effectuating, 8, maps each action to a new world 
state, W. Sensing, a, maps the world state to perceptions, 
P,. 

Let Cjt denote k:th reasoning-effectuating-sensing cycle, 
that is 

(3) has the following inteq^retation: 

By reasoning, the robot decides to execute an action 
sequence consisting of actions, which are send to the 
effectuating, e. Thereafter the robot uses sensing, a^, to 
acquire information about the resulting world state. 
Accordingly, the k:th perception can be written as: 

Pk = q(P,.,) =qq_,...c,(Po) (4) 


In applications where the agent corresponds to an autono- 
mous robot it is possible to describe the mapping (p) 
explicitly by a mathematical function. Accordingly, it is 
possible to, given a perception vector Pj and the mapping 
p, determine what action sequence will be executed conse- 
quently. This is possible since machine reasoning consists 
of well-defined deterministic operations. 

For applications where the agent corresponds to a tele- 
operated robot (or where man/machine cooperative deci- 
sion making^^ is used), the value of the mapping, p, corre- 
sponds to the action sequence (partly) decided upon by the 
human expert. Since it is hardly possible to establish a 
deterministic model of the reasoning process involved, the 
true mapping function, p, is (partly) unknown for tele- 
operated robots. 

However, in either case, the impact of an action on the 
world state will be the same. This implies that the percep- 
tions dependency on actions is invariant under different 
reasoning approaches. Consequently, this suggests that the 
same principles for maintaining the information quality 
can be utilized for both agent types. 

4 Information quality 

Having introduced a set of perceptual classes, a measure 
of the information quality of each perception is needed. 
The measure should reflect both the accuracy and rele- 
vance of a perception. External perceptions are commonly 
maintained at three abstraction levels to reduce complex- 
ity and enable inference. The lowest level contains numer- 
ical, the second geometric, and the third symbolic 
information respectively^. Consequently, the information 
quality measures at two distinct abstraction levels will dif- 
fer. In this paper, we will consider only the two lowest 
abstraction levels. Internal perceptions, describing the 
state of the robot, are often of limited complexity. Hence, 
only a numerical model is needed. For example, if ignor- 
ing dynamical properties, the state of a mobile robot 
equipped with a manipulator arm can be described by a 9- 
dim. vector (3 dim. for describing the robot pose and 6 
dim. for describing the position and orientation of the end 
effector). 

Accuracy 

Perceptual accuracy is a static attribute that is determined 
when the perception is created. It is static because no 
future event can affect the accuracy of an already made 
measurement. If perceptions are treated as vectors, one 
common way to represent accuracy for numerical percep- 
tions are by the corresponding covariance matrices. In this 
way, Kalman filtering techniques can be utilized to gener- 
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ate accurate state estimates. A classical paper on this is the 
one by Smith, Self, and Cheeseman^^ which provides a 
framework for handling robot positional uncertainty, and 
may be extended to also handle robot arm positional 
uncertainty. 

Relevance 

Besides being more or less accurate, a perception will also 
be more or less relevant. For example, a temperature mea- 
surement from one part of a building tells very little about 
the temperatures in other parts of the building, although 
the temperature was measured with high accuracy. Fur- 
thermore, after some time that particular temperature mea- 
surement tells very little about the current temperature 
even in the position where it was obtained. These two facts 
correspond to the limited spatial and temporal applicabil- 
ity range of perceptions. Thirdly, assume that the agent 
that did the previous temperature measurement decides to 
open a window. Provided it is a temperature difference 
between the inside and outside of the building, this action 
will reduce the relevance of the previous measurement. 
Thus, manipulatory actions executed by the agent is 
another cause of variation in relevance. 

While the first and third example illustrate the dependency 
of the relevance on the executed navigational or manipula- 
tory actions of the agent respectively, the second example 
illustrates that the application is not an ideal one-agent 
application. Since no real-world application is an ideal 
one-agent application, a perception aging function, monot- 
onously decreasing with time, must be used. 

The presented examples describe relevance for a numeri- 
cal perception. In Section 7 a relevance measure for geo- 
metric perceptions, based on the maximum entropy 
principle, will be described. 

Information Quality measure 

Our approach assumes a bidirectional dependency 
between perceptions and actions. This is an extension to 
the approach by Erdmann®, who assumes a unidirectional 
perception/action dependency. 

Section 3 introduced a description of the k:th perception as 
the result after a reasoning-effectuating-sensing sequence 
has been executed after an initial perception Pq has been 
created. 

The information quality of a perception vector P|^ is 
denoted QI(Pk). The evaluation of the information quality 
is as 


QI(P^) = ?r(Pr)) (5) 

where qi(pj) is evaluated as 

QiiPi) = max 

where k^ is iterated over all creation times of the percep- 
tions in the i:th perceptual class. 

Using (3) and (4), (5) can be rewritten as 

QHP,) = (7) 

The index n|^ must be selected as to satisfy the condition 

J > t ) a Ql(o*e"‘* ' P A- J < ^ («) 

is satisfied, where T is the information quality threshold 
value. This condition means that executing nj^^ actions in 
a sequence will result in an information quality value 
below the threshold value. Since robot malfunction corre- 
sponds to the failure to execute a particular action, the 
threshold value must take into account that for some 
actions an action failure is harmless while for irreversible 
hazardous actions a successful action execution is essen- 
tial. Thus, to permit the execution of an action, the infor- 
mation quality of the perceptions must exceed its 
corresponding threshold value, T. 

Predictions 

Knowledge about the statistical properties of the effectors 
enables the prediction of the environmental state resulting 
from an executed action. Furthermore, with information 
about the statistical properties of the sensors, it is possible 
to predict how the resulting environmental state should be 
perceived if a new perception where obtained. 

Given a perception and an action to execute, it is possible 
to predict what should be perceived after the action has 
been executed. The uncertainty in this prediction is deter- 
mined by the statistical properties of the stochastic map- 
ping. If this uncertainty is considered to be low enough, an 
additional action may be executed, with a corresponding 
new prediction. The uncertainty in this new prediction is 
higher than for the previous prediction. This can be iter- 
ated as long as the uncertainty in the prediction is low 
enough. When, at last, the prediction will be too uncertain, 
a new perception must be generated. 

The agent can predict the true world state resulting after 
each executed action by using knowledge about the most 
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probable outcome of the action. The resulting prediction 
of the world state after n^ actions (after time k) have been 
executed is denoted Pf^ a- Thus, 

Pk.a = e"'pA-i W 

The information quality of the prediction is denoted 
QI(^*,a). that is 

QI(Pk.,) = (10) 

where n^ must satisfy the condition 

[Ql[e"‘PtP*. J > rj A ‘ pA-i j < 7'j (11) 

The evaluation of (11) is done in a way similar to (6). 

5 Sensor planing 

Both autonomous and tele-operated robots require a crite- 
rion for when to acquire new information. According to 
the previous discussion this is necessary when the quality 
of the information is so low that the reasoner is likely to 
draw the wrong conclusions about the situation. This cor- 
responds to some perception being too inaccurate or too 
irrelevant. A sensor planning algorithm should keep track 
of the resulting accuracy and relevance of the perceptions 
as actions are executed, and enforce a sensing action if the 
accuracy or relevance has become too low. 

In sensor planning, an important difference between 
autonomous and tele-operated robots is their type of rea- 
soning (p). For autonomous robots, a well-defined map- 
ping from perceptions to actions is used, while for tele- 
operated robots, a human expert decides upon what action 
to execute next. In effect, autonomous robots may use (8), 
where the resulting information qualities are calculated 
before the sequence is executed. For tele-operated robots, 
the situation is different. Here, the sensor planning algo- 
rithm has no knowledge about what action will be exe- 
cuted next. Therefore, it must keep track of the infor- 
mation quality values during operation and stop the robot 
if the information quality has become too low. This 
implies that (11) should be used instead. 

6 Maximum entropy methods 

The information theoretical entropy concept has been 
applied in a variety of scientific disciplines. For a survey 
of entropy optimization techniques, we refer to Kapur and 
Kesavan^'^. In robotics, Saridis^®’ and Valavanis^"^ 


have described robot systems that use the concept of 
entropy as a global performance measure. In their 
approach, optimal robot behavior is achieved through the 
minimization of the total system entropy. Sanderson^^ 
uses the entropy concept to describe the complexity of dif- 
ferently shaped geometrical objects, measured in bits. 
Finally, the thorough study of relations between informa- 
tion theory and search theory conducted by Pierce^^ has 
inspired the development of the relevance measure for the 
case study described in Section 7. 

1 Case Study - Sensor planning on a tele^ooerated 

indoor in terve n tio n r pb Qt 

This case study demonstrates how the previously intro- 
duced concepts can be instantiated in a real-world applica- 
tion for a mobile robot equipped with a manipulator arm. 
The robot obtains information about the external state by 
using laser and video cameras. Information about the 
internal state is obtained through odometry and angle 
counters on the joints on the manipulator arm. This robot 
type is very general, but by constraining either the mobil- 
ity or the manipulability capabilities, more restricted robot 
types are obtained. In the case study, a representative set 
of perception and action classes are introduced. Further- 
more, to properly control the system, the robot must have 
a system model with low discrepancy. Because of limited 
computational power, the model must have as low com- 
plexity as possible. In order to establish a compact but yet 
useful model, it is important to exploit structure in the 
robot operations. In the case study, a composite 2D/3D 
model developed for the discussed robot type is elabo- 
rated. 

Robot operation 

During operation, the state changing actions executed by 
the robot can be classified as being either navigational or 
manipulatory. Navigation corresponds to the movement of 
the robot to a new location, while manipulation corre- 
sponds to the reorientation of the manipulator arm (the 
purpose of the manipulator arm is to change the state of 
the environment). This suggests the introduction of the 
two action classes Ajsj and A^^, denoting the navigational 
and manipulatory action class respectively. When the 
robot executes a sequence of navigational actions, it is said 
to be in navigational mode. Similarly, when the robot exe- 
cutes a sequence of manipulatory actions, it is said to be in 
manipulatory mode. 

Assuming a one-agent application, the world consists of 
two entities, agent and environment respectively. The state 
of the agent will be denoted the internal state while the 
slate of the environment will be referred to as the external 
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state. Accordingly, measurements of the internal state will 
be referred to as gauging, reflecting the contact-type of 
internal measurements, while measurements of the exter- 
nal state will be referred to as sensing. 

This gauging/sensing action class division is natural. First, 
the division conforms to the two-mode robot operation 
since only the internal state changes when in navigational 
mode. Second, the complexity of the internal state descrip- 
tion is much lower than the complexity of the external 
state description. Consequently, a division is necessary of 
the external state description into representations at differ- 
ent abstraction levels. As a comparison, it is possible to 
establish a basic description of the internal state using a 9- 
dim. vector, while sensor data of the external state may 
well contain 10,000 data points. 

The introduced four action classes is listed in (12): 

A^, A^, Aj^} (12) 

with the following respective interpretation: 

• As corresponds to Sensing actions, which measure the 
external state. 

• A <3 corresponds to Gauging actions, which measure the 
internal state. 

• An corresponds to Navigational actions, which change 
the global state. 

• Aj^ corresponds to Manipulatory actions, which 
change the local state. 

Pe rceptual classes 

Having introduced action classes in the previous section, 
this section presents a step-wise partitioning of the world 
description into a set of perceptual classes. 

Perceptions describe different aspects of the world and 
thus represents a world model. Typically, the data from 
one or more sensor is refined and transformed into a per- 
ception (a perception is similar to virtual sensor"^ and logic 
sensor^^ respectively). In turn, a perception may be used 
to generate a more abstract perception, or be combined 
with previous perceptions to generate a perception with 
higher accuracy. 

In the previous section, the two information acquiring 
action classes Aq and Ag was introduced. The correspond- 
ing perceptions resulting from information acquiring 
actions from respective action class are denoted internal 
and external perceptions respectively. As mentioned, the 
mobile robot can be viewed as being in either a naviga- 


tional or a manipulatory operation mode. This observation 
suggests the partitioning of the perceptions into global vs. 
local perceptions. Global perceptions are perceptions that 
provide vital information when in navigational mode, 
while local perceptions are perceptions that provide vital 
information when in manipulatory mode. This leads to the 
division of the perceptions into four perceptual classes 
(Table 7-1). 



Lx)cal 

Global 

Internal 

7al 


External 

llA 

^E.G 


TABLE 7-1 Perceptual Classes 


For the robot at hand, examples of perceptions belonging 
to each perceptual class are suggested below: 

• Pj The pose of the robot arm 

• Pj qI The pose of the robot 

• Pel- ^ manipulable object 

• P^ q: The spatial description of the building 

2D representations of the global perceptions P^g and Pjg 
suffice in most applications since the robot moves on 
almost flat surfaces and detected obstacles may be 
assumed to have infinite height. Having infinite height 
implies that it suffice to describe their projections on the 
2D plane. Thus, navigational actions are described within 
a 2D model, where the pose description contains three 
parameters (two for position and one for orientation). 

General manipulatory actions involve manipulation of 
objects arbitrarily oriented in 3D space. Since the range of 
the robot arm is limited, a 3D model will be reasonable. 
For each object that is to be manipulated, a 3D description 
of its closest environment is used. The local 3D descrip- 
tions are connected to the global (navigational) 2D model, 
thus providing a composite 2D/3D model. 

Accuracy 

Smith, Self, and Cheeseman’s formalism^^ may be applied 
for estimating the internal state, consisting of the robot 
pose (x, y, (p) , where x, y describe the position and (p the 
orientation of the robot, and of the position and orientation 
of the robot arm (x, y, z, (|>, 9, \\f) , where x, y, z corre- 
sponds to the position and (p, 9, \p corresponds to the ori- 
entation of the end effector using euler angels. The 
introduced information quality threshold value, T, will in 
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this case correspond to a matrix where the elements should 
not be exceeded by the corresponding elements in the 
covariance matrix for the 9-dim. vector describing the 
compound internal state. 

Relevance 

The relevance of a perception is affected by spatial and 
temporal factors. While the former concerns the topology 
of the environment, the latter provides means for compen- 
sating for the one-agent assumption in applications actu- 
ally containing several agents. Typically, the aging of the 
perceptions is modelled by simply scaling the perceptions 
with a monotonously decreasing weight function. The spa- 
tial factors affecting the relevance of perceptions of the 
global external state will be elaborated below, yielding a 
relevance measure applicable to laser range data*, 
although it may be modified to be used for sonar range 
data instead. 

An assumption being made is that the gathering of spatial 
information is costly in time and resources. Therefore, 
new spatial information should be acquired only when 
absolutely necessary. To determine when this is the case, a 
measure indicating the relevance of a perception is 
needed. 

Consider the situation in Fig.3. Spatial information 
obtained from position pi will lack information about the 
region N.E. of p2. The sector indicated in the figure will 
accordingly be referred to as a hidden sector. The hidden 
sector indicates that the spatial information from pi is not 
as relevant for describing the environment of p 2 as it is for 
describing the environment of pi. If the only spatial infor- 
mation available is the one from pi, it is impossible to tell 
whether the spatial information not contained in pi is 
important or not. Let R denote the statement “Relevant 
information” and — >R the statement “Not relevant informa- 
tion” (e.g, in search operations, indications of the object 
being searched for are considered relevant). Letting pj(R) 
denote the probability that scan direction i contains rele- 
vant information, then Laplace’s principle of insufficient 
reason states that Pj(R) = Pi(-»’R) = 1/2 if no prior informa- 
tion is available. 


* Using range data from either laser or sonar is a common 
way to build up maps of indoor environments^ ’ ^ 



Fig. 3. Relevance of spatial information, hidden sector 

Assume that the laser measures range in M consecutive 
directions in a horizontal plane. Then M probability func- 
tions are introduced. It is assumed that after a measure- 
ment has been made, it is clear whether a particular 
direction contains relevant information or not. Thus, after 
a measurement p. (R) e {0, 1} . For large M, the angle 
resolution will be high enough to justify the assumption 
that no new information will come out of an additional 
measurement of a previously scanned region. 

In the case of Fig.3 this implies that the only new informa- 
tion coming out of a measurement from position p 2 is 
information about the region that is not described by the 
measurement from pi. Assuming the angle between con- 
secutive scan directions to be A 9 and that the hidden sec- 
tor has angle a, a scan from p 2 will provide new 
information from d = a/Aq> directions. Letting p and q 
denote the ensemble of probability functions correspond- 
ing to the measurements from p 2 and pi respectively, then 
the amount of new information obtainable from p 2 given 
the information from pi is 

d 

H(p\q) = -^ O’) log {p. 0) ) (13) 

» = 1 J 

where j e {/?, -n/?} . Using the maximum entropy princi- 
ple, the sum in (13) is equal to d • log 2 , which is the 
amount of new information obtained if scanning from 
position p 2 , or equivalently stated the amount of informa- 
tion about p 2 :s environment not included in the spatial 
description from pi. As is seen, the obtained information 
is proportional to the width of the hidden sector. 

The amount of information obtained from the initial scan 
is M ■ Iog2 . No later scan will give that much informa- 
tion. This suggests that the information quality thresholds 
for the external perceptions should belong to the interval 
[ 0 , M •log 2 ]. 
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In the general case there are k hidden sectors, correspond- 
ing to k sub-intervals in the interval 1,2,..., M that provide 
new information (see Fig A). 



• • • 


“j 

• • • 

pi 

• it 



• • • 


Fig. 4. k hidden sectors 

Extending (13) to cover k hidden sectors yields 

k 

H{-p\q\) = (14) 

A = 1 / = 1 J 

which, according to the maximum entropy assumption is 
equal to 

k 

log 2 • (IS) 

h = 1 

Notice that if the laser range data is segmented by using 
polyline segmentation^, the hidden sectors can be calcu- 
lated directly, since the information needed to calculate the 
hidden sectors is information about the coordinates of the 
endpoints of the line segments. 

Information quality 

As mentioned, there is a close connection between percep- 
tions and actions in one-agent applications. This section 
describes this dependency. fig.S captures the dependency 
between the action and perception classes. The actions 
positive/negative influence on the information qualities of 
the perceptions are indicated by the or superscript. 



Fig. 5. Perception/Action class dependency 

In FigJ, the dependency within a pair of perception/action 
classes is bidirectional. The information qualities of the 
perceptions are either increased or decreased depending on 
what actions that have been executed. A similar depen- 
dency is present in the opposite direction since the infor- 
mation qualities of the perceptions must be sufficiently 
high in order for the reasoner to perceive the situation cor- 
rectly and select proper actions. In FigJ, the perceptual 
classes have the following interpretations: 

• Internal global state (Pig) corresponds to the position 
and orientation of the robot, and is measured by odom- 
etry. 

• Internal local state (Pj^) corresponds to the position 
and orientation of the end effector (robot hand), and is 
measured by combining measures from angle counters 
on the joints of the robot arm. 

• External global state (Peg) corresponds to a line seg- 
ment description of the robot environment. The 
description is created by line segmentation^ of laser 
data, and is measured by a laser range finder. 

• External local state (Pel) is measured by vision (TV 
camera). 

For navigation tasks, the operator uses the line segment 
description of the robot environment, while for manipula- 
tory tasks views from TV cameras are used. The percep- 
tion/action dependency may then be illustrated by the 
dependency graph in Fig.6, which emphasizes the two- 
mode operation. The nodes in the leftmost column corre- 
spond to sensors, the nodes in the middle column corre- 
spond to perceptions, and the nodes in the rightmost 
column correspond to actions. 
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Fig. 6. Information quality dependency graph 

Odometry (O) gauges the internal global state, the descrip- 
tion of which is stored in Pj < 3 . New measurements cannot 
reduce the increasing positional uncertainty as the robot 
moves, although Kalman filtering techniques can reduce 
this problem. For this reason, the arrow from 0 to Pj g in 
Fig.6 is unlabeled. Laser (L) senses the external global 
state, the description of which is stored in Pg g* Acquiring 
new laser data improves the relevance of Pe g^ indicated 
by a “+”-sign on the arrow between L and P^ g in Fig.6, 
Angle counters (A) gauge the internal local state, the 
description of which is stored in Pj l- Although new mea- 
surements cannot reduce the uncertainty in Pj l, the uncer- 
tainty will not increase monotonously as the uncertainty 
corresponds to lashes in the joints. Vision (V) senses the 
external local state, the description of which is stored in 
Pe.L- 

In this application, the external local states corresponds to 
local descriptions of particularly interesting regions in the 
environment (e.g. regions containing manipulatable 
objects). The relevance measure introduced in Section 4 is 
intended for external local perceptions (which are 2D). A 
similar measure for the external local perceptions (which 
are 3D) may be developed based on the concept of hidden 
cones although this is not described in this paper. 

Navigational actions, corresponds to the transport of 
the robot to a new position in the environment. The accu- 
racy of the internal global perception Pj g is decreased due 
to accumulation of positional uncertainty. Also, the rele- 
vance of the external global perception is decreased, in 
accordance with the fact that the applicability of the exter- 
nal global perception may decrease as the robot moves 
away from the position at which the external global per- 
ception was generated. 

Manipulatory actions, A^, corresponds to the movement 
of the manipulator arm, which may affect the accuracy of 
the internal local perception and/or the relevance of the 
external local perception. 


Finally, as mentioned, the increased uncertainty in the esti- 
mation in the robot position is impossible to eliminate by 
solely using odometry. However, it is possible to combine 
sensing and gauging actions to improve the information 
quality of Pi g-^- 

Platform evaluation 

Having a method for maintaining the information quality 
at an acceptable level, the problem arises what threshold 
values to use. By solving a task, typical for the application 
at hand, with distinct information quality threshold value 
settings, the setting providing the best trade-off between 
safety and speed should be used. To compare different 
robot configurations (i.e., using different sensors and 
effectors), the optimal parameter setting for each robot is 
determined whereafter their optimal performances are 
compared. If an information quality value of a perception 
could be held close to zero without affecting the perfor- 
mance, this suggests that the corresponding perception is 
of little use in the application and might be omitted. By 
assigning each action (a) a cost C(a), for example corre- 
sponding to execution lime, the cost to use a particular 
platform is easily obtained by summing the cost of all 
actions in the action sequence that was executed during 
the mission. This value could then be combined with other 
factors, such as cost of sensors, number of errors during 
the mission etc., in a platform evaluation. Below, two plat- 
forms are described, one open-loop (without sensors) and 
one closed-loop (with sensors). 

Open-loop robot platform 

The mission cost (C^) for this system is expressed as 
Cm= 2 

ze {N,M} i 

As is seen, no sensing actions are executed, which corre- 
sponds to setting the information quality thresholds to 
zero. This system is appropriate only in a highly structured 
world. 

Closed-loop robot platform 

The mission cost in this case is expressed as 

z€ i 

By repeating the solution of a task with different informa- 
tion quality threshold values, a compromise between 
safety and speed can be found. If a low information quality 
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threshold value provides fairly high safety, this suggests 

that the corresponding perception is of little use and may 

be omitted. 
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Abstract 

The Network Data Delivery Service (NDDS) is a novel 
network data-sharing system. The NDDS system builds 
on the model of information producers (sources) and con- 
sumers (sinks). Producers register a set of data instances 
that they will produce^ unaware of prospective consumers 
and “produce” the data at their own discretion. Con- 
sumers “subscribe” to updates of any data instances they 
require without concern for who is producing them. The 
routing protocol is connectionless and nearly “stateless”; 
all data producer and consumer information is dynam- 
ically maintained. Thus network reconfigurations, node 
failures, etc. are handled naturally. 

This scheme is particularly effective for systems (such 
as distributed control systems) where information is of a 
repetitive nature. NDDS features the ability to handle 
multiple producers, consumer update guarantees, notifi- 
cations vs. polling for updates, dynamic binding of pro- 
ducers and consumers, user-defined data types, and more. 
NDDS is integrated into the ControlShell real-time frame- 
work, and is being used in several robotics applications as 
an effective means of information sharing between sensor 
systems, robot controllers, planners, graphical user inter- 
faces, simulators etc. 

This paper describes the philosophies behind the NDDS 
system, and details an example application of a dual-arm 
robotic system capable of planning and executing complex 
actions under control of an interactive user interface. 

1 Introduction 

Many control systems are naturally distributed. This is 
due to the fact that often they are composed of several 
physically distributed modules: sensors, command, con- 
trol and monitoring modules. In order to achieve a com- 
mon task, these modules need to share timely information. 
Robotic systems are a prime example of such distributed 
control systems. 


Stan Schneider 

Real-Time Innovations, Inc. 

954 Aster 

Sunnyvale, California 94086 

These information sharing needs are common to many 
other application environments such as databases, dis- 
tributed computing, parallel computing, transaction sys- 
tems, etc. However, distributed control applications have 
some unique requirements and characteristics: 

♦ Data transactions in control applications are often 
time-critical. To be useful for control purposes data 
must get from its source to its destination with min- 
imum delay. 

♦ There is often the need to synchronize computation 
to the arrival of new data. For example the control 
command may need to be updated only when new 
sensor data arrives. A collision-avoidance plan may 
need to be re-evaluated when a new obstacle is de- 
tected, etc. 

♦ A significant portion of the data flow is repetitive in 
nature. This is true of sensor readings, motor com- 
mands etc. For this type of data, data loss is often not 
critical: sending data is an idempotent operation and 
new updates just replace old values. This property 
suggests that considerable overhead can be avoided 
by using a data transfer paradigm that exploits these 
facts. 

♦ There are often multiple sources of (what may be 
considered) the same data item. For example, a robot 
command might be generated by a planner module 
as well as a tele-operation module. In the same way 
there can be many data consumers. A robot and a 
simulator are both sinks of “command-data.” The 
network of data producers and consumers may not 
be known in advance and may change dynamically. 

♦ Data requirements are ubiquitous and unpredictable. 
It is often very difficult to know what data will be re- 
quired by other modules. For instance, force-level 
measurements — normally used only by a low-level 
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controller — may be required by a sophisticated high- 
level task planner in the future. The architecture 
should support these types of data flow. Thus, vital 
data should be accessible throughout the system. 

• Most data flow can and should be anonymous. Pro- 
ducers of the sensor readings can usually be unaware 
of who is reading them. Consumers may not care 
where the data they use came from. Since it is not 
essential, hiding this information increases modular- 
ity by allowing the data sources and sinks to change 
transparently. 

These requirements are sufficiently unique to deserve 
special treatment. To address all this needs we have de- 
veloped the Network Data Delivery Service (NDDS). 

NDDS provides transparent network connectivity and 
data ubiquity to a set of processes possibly running in 
different machines, NDDS allows distributed processes to 
share data and event information without concern for the 
actual physical location and architecture of their peers^ . 
NDDS allows its “clients” to share data in two ways: sub- 
scriptions and one-time queries, 

NDDS supports “subscriptions” as a fundamental 
means of communication. In the application context de- 
scribed, subscriptions have some fundamental advantages 
over other information sharing models (such as client- 
server or shared- memory). Subscriptions cut in half the 
data latency over query/response type models and it al- 
lows synchronization on the latest available information 
as soon as it is produced. 

NDDS supports multiple information sources (produc- 
ers) and users (consumers). It provides clear semantics for 
multiple-producer conflict resolution, provides support for 
and guarantees multiple update rates (as specified by the 
consumers), and uses decaying state at all levels to ensure 
inherently robust communication. 

NDDS is into the ControlShell real-time programming 
framework [4, 7] and is being been used in several appli- 
cations including the control of a two-armed robotic sys- 
tem [6], an underwater vehicle [10], and a self contained, 
two-armed space robot originally described in [9] . 

2 Relation to Other Research 

There is a large body of literature covering information 
sharing in distributed computer systems [1, 2]. More re- 
cently new schemes have been developed to address the 
specific needs of distributed control applications (see ci- 
tations below). 

^ NDDS is being used to communicate between Sun, HP and DEC 
workstations sts well eis VME-based real-time processors running the 
VxWorks operating system. 


Several issues are relevant when comparing different 
data-sharing approaches. 

• Abstraction presented to the user. How natural and 
appropriate is it for the specific domain of distributed 
robotic systems? 

• Robustness. Recovery from computer and communi- 
cation failures. 

• Flexibility and Expandability. How easy is it to 
add/replace component modules. Can it be done dy- 
namically^ while the system is in operation? 

In the last few years, several distributed data-sharing 
schemes have been developed to address many of the is- 
sues raised in the introduction among these MBARFs 
Data Manager [5], CMU’s TCX [3], Rice University’s 
TelRIP [11] and Sparta’s ARTSE [8] all offer network- 
transparent connectivity across different platforms and 
support subscriptions as means of communication where 
multiple consumers can get updates from a single pro- 
ducer. Of these, only the Data Manager, provides sup- 
port for multiple (consumer-specified) update rates. And 
only TelRIP supports multiple producers of a single data 
item. None of the above architectures combine the above 
facilities with NDDS’s fully-distributed, symmetrical im- 
plementation (absence of privileged nodes) nor use a re- 
startable handshake-free stateless protocol. 

3 The NDDS Communication 
Model 

The NDDS system builds on the model of information 
producers (sources) and consumers (sinks). 

Producers register a set of data instances that they 
will produce, unaware of prospective consumers and “pro- 
duce” the data at their own discretion. Consumers “sub- 
scribe” to updates of any data instances they require 
without concern for who is producing them. In this 
sense the NDDS is a “subscription-bcised” model. The 
use of subscriptions drastically reduces the overhead re- 
quired by a client-server architecture; Occasional sub- 
scription requests, at low bandwidth, replace numerous 
high-bandwidth client requests. Latency is also reduced, 
as the outgoing request message time is eliminated. 

NDDS identifies data instances by a name (the NDDS 
name). The scope of this name extends to all the tasks 
sharing data through NDDS. Two instances with the same 
NDDS name are viewed by NDDS as different updates of 
the same data instance and are otherwise indistinguish- 
able to the client. If two data instances must be distin- 
guished by any NDDS client, they must be given a differ- 
ent NDDS name. 
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Function 

Action 

N ddsRegister Producer 

Register and specify producer 
parameters 

NddsProduce 

Add an instance produced by 
the producer 

NddsSampleProducer 

Take a snap-shot of all the in- 
stances produced by the pro- 
ducer. For immediate produc- 
ers also send updates to all 
consumers. 

NddsRegisterConsumer 

Register and specify consumer 
parameters 

NddsSubscribeTo 

Add a subscription to a con- 
sumer. Specify a call-back rou- 
tine to be called on updates 

NddsReceiveUpdates 

Poll the consumer for updates. 
Will result on call-back rou- 
tines being called when appli- 
cable. Required only of polled 
consumers. 

NddsQuerylnstance 

Issue a one-time query. 


Table 1: Functional interface to produce, consume 
and query data. 

Producing data, involves three phases: Registering (declar- 
ing) a producer f declaring the instances the producer will 
produce and sampling the producer. Receiving data up- 
dates also involves three phases: Registering (declaring) a 
consumer, declaring the instances that the consumer sub- 
scribes to along which the action to be taken and lastly 
receiving the updates. The last phase is only required for 
polled consumers. 


NDDS requires all data instances to be of a known type. 
NDDS has some built in types (such as strings and arrays) 
but most data flow consists of user-defined types. Creat- 
ing a new NDDS type involves binding a new type-name 
with the functions that will allow NDDS to manipulate 
instances of that type. 

NDDS treats producers and consumers symmetrically. 
Each node maintains the information required to estab- 
lish communications. Producers inform prospective con- 
sumers of the data they produce. Consumers use this 
information to either subscribe to data or issue one-time 
queries. Table 1 lists the steps involved in becoming a 
producer or consumer of data. 

3.1 Producer Characteristics 

A producer can be compared to a multi-channel Sample- 
and-Hold. It is associated with a set of object instances 


(similar to the signal channels) that get sampled syn- 
chronously. Sampling a producer takes a sample of the 
values of each data item the producer has associated with 
it. The data is either immediately distributed (for imme- 
diate producers) or saved for later distribution (delayed 
producers). 


time 

Accept only 
updates of 
higher strength 

Accept 
any update 




data update 

persistence 



Figure 1: Multiple producer conflict resolution. 

NDDS resolves the multiple-producer conflict by charac- 
terizing each producer with two properties: the producer's 
strength and its persistence. When a data update is re- 
ceived for some object instance, it is accepted if either 
the producer's strength is greater (or equal) to that of the 
producer of the last update for that instance or, the time 
elapsed since the last update was received exceeds the per- 
sistence of the producer of the last update. In essence the 
strength is like a priority and the persistence is the duration 
for which the priority is valid. 

A producer is characterized by three parameters: its 
production rate, its strength and its persistence. The 
strength and persistence parameters are used to resolve 
multiple-producer conflicts. Their meaning is illustrated 
in Figure 1. A producer’s data is used while it is the 
strongest source that hasn’t exceeded its persistence. 
Typically a producer that will generate data updates ev- 
ery period of length T, will set its persistence to some 
time Tp where Tp > T. Thus, while that producer is func- 
tional, it will take precedence over any producers of less 
strength. Should the producer stop distributing its data 
(willingly or due to a failure), other producers will take 
over after Tp elapses. This mechanism establishes an in- 
herently robust, quasi-stateless communications channel 
between the strongest producer of an instance and all the 
consumers of that instance. 

3.2 Consumer Characteristics 

Consumers are notification based. They subscribe to 
a set of instances (identified by their NDDS name) by 
providing call-back functions for each instance they sub- 
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Figure 2: Consumer notification rate. 

The NDDS characterizes consumers requests for periodic 
updates with two properties: the consumer's minimum sep- 
aration time and its deadline. Once the consumer is called 
with an update for an object instance, it is guaranteed not 
to be notified again of the same instance for at least the 
minimum separation time. The deadline is a the maximum 
time the consumer is willing to wait for a new update. Even 
if new updates haven't arrived, the call-back routine will be 
called when the deadline expires. 
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Figure 3: One-Time Query Parameters. 

A one-time query specifies two parameters: the wait time 
and the deadline. A query will block for at least wait 
time during which the data arriving from the producer with 
higher strength is saved. At the end of the wait time the 
query returns if any data has been received. Otherwise, 
query remains blocked until either an update comes or the 
deadline expires (whichever comes first). 


scribe to. When a data update for a subscription arrives, 
the call-back function of every consumer is called with the 
data as a parameter. 

Two consumer models are supported: immediate and 
polled. An immediate consumer will be called back as soon 
as the data update arrives. A polled consumer will not be 
called back until it itself “polls” for updates. Consumers 
are characterized by two parameters, the minimum sepa- 
ration and the deadline (see figure 2). These parameters 
are used to regulate consumer update rates. Consumers 
are guaranteed updates no sooner than the minimum sep- 
aration time and no later than the deadline. Typically 
the minimum separation protects the consumer against 
producers that are too fast whereas the deadline provides 
a guaranteed call-back time that can be used to take ap- 
propriate action (the expiration of the deadline typically 
indicates lack of producers or communications failure). 

3.3 One-Time Queries 

A client task may issue one-time queries for specific 
NDDS data instances. Queries are blocking calls. Aside 
from specifying the name and type of the NDDS data 
instance, a query contains two parameters: the wait and 
deadline illustrated in figure 3. These parameters regulate 
the tradeoff between returning as soon as data becomes 
available and waiting for “better” data. The use of these 


parameters make the latency of this call predictable, al- 
lowing its use from real-time application code. Typically 
the wait is set to be long enough to account for commu- 
nication delays from all data producers to the consumer. 
The deadline provides a guaranteed call-back time in case 
no responses arrive. Setting a wait time to 0 causes the 
first response to be accepted. 

4 Implementation 

NDDS is symmetrically distributed, that is, there are no 
“special” or “privileged” nodes nor name servers. All 
NDDS nodes are functionally identical and each node 
maintains its own copy of the NDDS database and con- 
tains the helper processes necessary to implement the 
communication model described above. 

NDDS uses UDP as a means of communication. Data 
is encoded using XDR to allow communications between 
computers with different data representations. 

4.1 Architectural Overview 

An NDDS node is composed of one or more NDDS client 
processes (each with its respective NDDS Server Daemon) 
a copy of the NDDS database and three daemon (helper) 
processes that maintain the database and implement the 
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Figure 4: Communication between NDDS nodes. 

A NDDS node is composed of one or more NDDS clients and 
tliree helper daemons that share a local copy of the NDDS 
database. Each NDDS client has a private NDDS Server 
Daemon that informs other NDDS nodes of the productions 
and subscriptions of the client. The three helper daemons 
are responsible for maintaining the NDDS database, send- 
ing updates to remote subscribers, receiving updates and 
servicing queries. There is one NDDS Forwarding Daemon 
per Network host. 


NDDS communication model described above. See fig- 
ure 4. 

The user task becomes an NDDS client by linking to 
the NDDS library. Each NDDS client process spawns 
a private NDDS Server Daemon process that will assist 
in establishing the subscriptions and informing the peer 
nodes of the user productions. There is at most one NDDS 
node per address space so in operating systems that sup- 
port shared memory threads (for example VxWorks), sev- 
eral NDDS client processes may belong to the same node 
(sharing the same copy of the NDDS database and helper 
daemons^). 

The following is a functional description of the different 
daemons: 

• NDDS Forwarding Daemon (NFD). There is one 
NFD per network host. All the Request Receiver 
Daemons running on the host register with the NFD. 
Production notifications and subscription requests 

^In operating systems that don’t support shared memory 
threads, such as Unix, the helper daemons are not independent tasks 
but rather are installed as signal handlers. 


received by the NFD daemon are immediately for- 
warded to all the Request Receiver Daemon(s) run- 
ning on the host. 

• NDDS Server Daemon (NSD). Each NDDS client 
(user-task) spawns its private NSD. The NSD is re- 
sponsible for periodically informing the all the other 
NDDS nodes of both the subscription requests and 
the productions of the NDDS client. 

• Request Receiver Daemon (RRD). There is one 
RRD per NDDS node. The RRD is responsible 
for maintaining the remote subscriptions and pro- 
ductions in the NDDS database. Stale productions 
and subscription requests are aged and eventually 
dropped by the RRD. This daemon is also responsi- 
ble for replying to one-time queries from other NDDS 
nodes. 

• Update Sender Daemon (USD). There is one USD 
per NDDS node. The USD is responsible for sending 
the updates of locally produced data items to the 
subscribers in other NDDS nodes. This daemon also 
ensures that the the timing parameters requested by 
the consumer are met. 

• Update Receiver Daemon (URD). There is one 
URD per NDDS node. The URD is responsible for 
receiving updates for the local subscriptions of the 
nodes. The URD solves multiple-producer conflicts 
and, in the case of immediate consumers, executes 
the callback routine(s) installed for that data item. 
The URD also ensures that the timing parameters 
requested by the each consumer in the node are met. 

4.2 Data Management Overview 

The NDDS database is replicated and maintained on each 
NDDS node by three helper daemons (the Request Re- 
ceiver Daemon, Update Sender Daemon and Update Re- 
ceiver Daemon). The database stores and cross references 
producers and the data they produce, consumers and the 
data they consume, remote productions, subscriptions re- 
quested by both the NDDS clients in both the local and 
remote NDDS nodes, etc. 

Consistency between databases across different NDDS 
nodes is not necessary and requires no special effort. Tem- 
porary inconsistencies between databases may result on 
subscription requests (or queries) not reaching all the pro- 
ducers of a given data item and, as a consequence, dif- 
ferent nodes may get data from different producers. A 
similar situation may result from the data loss due to 
communication failure. At worst this will be a transient 
situation that arises only if there are multiple producers 
of the same data. 
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All information about remote NDDS nodes is aged and 
is eventually erased unless it is refreshed. The NDDS 
Server Daemon associated with each NDDS client is re- 
sponsible for the periodic refreshment of the information 
that concerns that NDDS client . This mechanism is in- 
herently robust to remote node failures, communication 
dropouts and network partitioning. Furthermore, it re- 
quires no special recovery mechanisms. 

5 Applications 

This section describes how a distributed robotic applica- 
tion exploits NDDS unique facilities to build a modular 
expandable system that integrates planning into a two- 
armed robotic workcell [6]. The system is composed of 
four major components: user interface, planner, the dual- 
arm robot control and sensor system, and an on-line simu- 
lator. The graphical user interface provides high-level user 
direction. The motion planner generates complete on-line 
plans to carry out these directives, specifying both sin- 
gle and dual-armed motion and manipulation. Combined 
with the robot control and real-time vision, the system 
is capable of performing object acquisition from a moving 
conveyor belt as well as reacting to environmental changes 
on-line. 



Figure 5: Application Example: Task-Level control 
of a Two- Armed robot. 

This example shows the main application modules. Each 
module communicates using one or more of the three in- 
terfaces: The World Model interface (WMif), the Robot 
Interface (Rif) and the Task Interface (Tif). These mod- 
ules are physically distributed. All the interfaces are built 
using the Network Data Delivery Service (NDDS). NDDS 
plays the role of a bus providing the necessary module in- 
terconnections. 

The use of NDDS as the underlying communication 
mechanism provides unique benefits to this application 
without requiring any special programming. 

• The different modules can be distributed across dif- 


ferent computer systems (with different processor ar- 
chitectures and operating systems)^. 

• Several copies of the same module can be run con- 
currently. For example, the graphical user interface 
module can be run on several workstations. This al- 
lows multiple users to monitor the system and per- 
mits simultaneous interaction with the robot system. 

• The graphical simulator module can mascarade as the 
real robot and allow independent testing of the re- 
maining modules in the system. Any time the real 
robot goes on-line, its productions override those of 
the simulator^ and all the remaining modules are now 
connected to the real robot. 

• Should the planner be unable to generate adequate 
plans for specific situations due to limitations or mal- 
functions, a teleoperation module (under develop- 
ment) can override planner commands and take con- 
trol of the robot. 

• Future modules (such as the teleoperation-module 
mentioned above) can be dynamically added to the 
system even if it is already in operation or deployed^. 

6 Conclusions 

This paper has presented NDDS, a unique data-sharing 
scheme that allows programs distributed on a computer 
network to share data and event information unaware of 
the location of their peers. These facilities provide fun- 
damental new capabilities to distibuted control systems 
that use NDDS as the underlying communication mecha- 
nism. This paper has also discussed an application that 
uses NDDS to communicate between different modules 
that integrate planning into a two-armed robotic work- 
cell. Several other applications are cited in the paper. 
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ABSTRACT 

Development and use of robotic systems are 
proposed to reduce the excessive human labor 
requirements associated with growing, processing, 
and recycling plants grown in controlled 
environments for agriculture and long duration 
human space missions. This paper presents a way 
of assessi ng requ i rementsforintelligence-based 
robotics in an Advanced Life Support System (ALSS) 
involving both physicochemical and 
bioregenerative technologies. 

In support of the application of system 
engineering principles to ALSS, leading to 
requirements assessment of robotic equipment, an 
approach is proposed which has four distinct 
components: (1) delineate the story of the mission 
in terms of processes with attendant types of 
resources needed, including options for use of 
robotic systems, (2) provide dynamic mathematical 
models of the biophysical processes involved, (3) 
provide 3-D animated graphical models of physical 
materials handling processes, and (4) provide 
systems dynamics models which, when used in 
simulation, reveal the implications for selected 
resource allocation schemes in terms of resources 
required to complete operational tasks. The 
simulations not only help establish ALSS design 
criteria such as production procedures and volume 
necessary for each crop, but they also may offer 
guidance to ALSS research efforts by identifying 
gaps in knowledge about procedures and/or 
biophysical processes. 
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1 . Introduction to Advanced Life Support 
Systems (ALSS) 

NASA has studied controlled ecological life 
support systems (CELSS's) (Averner, 1990), or 
ALSS's, which are bioregenerative and based on a 
combination of biological and physicochemical 
processes si nee 1978. These systems may be used 
on future human space missions in low-Earth orbit, 
in transit to other planets, and on lunar and 
planetary surfaces. The purpose of an ALSS is to 
provide food and to replenish supplies of oxygen 
and water for long-term voyages in space. The 
basic physiological processes of plant growth 
(photosynthesis, nutrient uptake, translocation, 
transpiration, and respiration) combine carbon 
dioxide, minerals, and water to form food and 
inedible plant materials which must be recycled to 
supply nutrients for subsequent crops. Photosyn- 
thesis removes carbon dioxide exhaled by the 
astronauts and supplies oxygen for breathing. 
Transpiration adds odorless moisture to the air, 
where the water can be condensed and purified 
for consumption. Extensive work on the ALSS has 
been done by NASA Ames Research Center, 
Kennedy Space Center, Marshall Space Flight 
Center, and Johnson Space Center (Ming and 
Henninger, 1989). 

The successful culture of higher-order plants 
requires the careful management of complex 
biological processes. An ALSS must have seeds (or 
other propagules), a source of mineral nutrients, 
water, a plant growth medium, and radiant energy 
(light). Labor and machinery are required to move 
materials from one process to another. It is 
necessary to monitor the state of the ALSS and 
diagnose and correct identified problems. Artificial 
intelligence offers supervised intelligent systems 
for monitoring and control of process operations 
and various powerful approaches to inspection, 
test, and diagnosis of problems, such as model- 
based diagnosis, as well as recovery from problem 
episodes. 
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Food production alone can be very labor 
intensive unless assisted by automated devices. 
During the 1920’s, farmers without tractors, 
combines, and other machinery were able to 
produce enough food to feed themselves and one 
other person. Now, American farmers, fully 
mechanized with tractors, combines, and other 
machines, are capable of growing not only their 
own food, but enough for 50 others! When food 
processing, supply, and marketing efforts are 
included, about 1 in 5 Americans work in an 
agriculturally related business. Russian experi- 
ments conducted in BIOS 3 required an average of 
4 hours per day per crew member to produce, 
process, recycle, and prepare food (Schwartzkopf, 
1991). Biosphere 2 participants spent from 2 to 
3 hours per day each operating their food 
production system. 

In the ALSS, the processes are not restricted to 
production and processing, but must also include 
efforts to recycle inedible portions of plants into 
nutrients which can be utilized by succeeding 
crops. An ALSS for space exploration must be 
closed-loop. Nothing can be wasted nor allowed 
to accumulate in an unusable form. It is not 
sufficient to collect the garbage and bury it in 
landfills. 

Biomass processing is an essential process 
which cannot be forgotten. It will consume labor, 
utilize resources, and have by-products which must 
be recycled to recover nutrients. 

When one considers all of these activities 
required to keep the ALSS functioning, one will 
not be surprised when simulations indicate that 
robotic systems will be needed to avoid crew 
overload or even to complete the mission. 

JSC is developing a Human Rated Test Facility 
(HRTF) (Henningerand Tri, 1993) in which to 
conduct year-long, high-fidelity tests of the ALSS, 
complete with four 90-day stays by crews of four. 

In order to gain the most from these experiments, 
a structured process of systems engineering (Petri, 

1993) for the whole HRTF mission is being pursued 
within an integrated computer aided concurrent 
engineering environment (Erickson and Lawler, 

1994) , soon to be accessible via Internet. We want 
to design an HRTF that optimally considers the 
roles of humans, automation, and robotics. JSC 
will adapt and integrate, and develop where 
needed, automation and robotics technologies for 
applicability to the HRTF. JSC intends to carry out 
some of the robotic systems development through 
industry and university partnerships. As a key part 


of this systems engineering process, models and 
simulations are being developed. The models and 
simulations will be used to evaluate alternative 
processes, determine an optimum design of the 
HRTF, and to determine the most critical data to 
obtain during the human rated tests. The results 
of these experi ments wi 1 1 be used to i m prove the 
models. Thus, the models and simulations will 
become a vehicle for structuring information and 
disseminating knowledge to peer scientists and 
engineers at other institutions. 

The development and demonstration of a 
successful closed-loop ALSS is sure to have impli- 
cations on how practically every community in 
America handles its "waste" and recyclables. 

Being able to disseminate this knowledge is just as 
important as discovering new information. Thus, 
models and simulations are key methodologies in 
ALSS research efforts. 


2. Models and Simulations 

Models are mathematical equations 
(including computer codes) which describe 
biological, physical, chemical, and operational 
processes. Lord Kelvin once said: "If you cannot 
model it, you don't understand it." The most useful 
models for design purposes are based on cause and 
effect relationships, not statistical correlations. 
Simulations are the process of utilizing the models 
to predict responses to changes in input conditions 
and/or equation parameters. 


3. Biophysical Process Modeling 

The biophysical processes in an ALSS not only 
have an impact on the design of the HRTF, but also 
influence the design of automation and robotics 
used as part of the HRTF. This isso for robotic 
systems because the crop production (volume of 
crop per unit area times area) directly impacts how 
much needs to be moved at what times. The rates 
of production determine the times for certain 
tasks. 

The complexity of the biological, physical, 
and chemical processes in an ALSS requires mathe- 
matical (computer) models be developed for each 
candidate crop and each candidate process. At 
least as many as 10 different crops will be required 
to provide a balanced, nutritive diet for the 
astronauts. To meet operational constraints (such 
as carbon dioxide levels and peak energy 
demands), multiple stages of the same crop will be 
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likely. Photosynthesis, translocation of 
carbohydrates, transpiration, nutrient uptake and 
translocation, and respiration are a few of the 
processes which must be expressed as mathe- 
matical relationships between the environment 
(temperature, humidity, light intensity, etc.) and 
crop physiology (plant type, light interception, 
carbohydrate levels, etc.). By writing these 
expressions as cause and effect equations, the 
models can be used to accurately predict ALSS 
responses and to determine the optimum design. 
Because communities (or populations) of leaves, 
stems, roots, etc., are involved, the equations 
cannot be at a molecular detail, but must predict 
the overall population response. Just imagine the 
complexity of attempting to predict temperature 
(T), pressure (P), and volume (V) relationships in a 
gas by modeling molecular motions and inte- 
grating the impacts on vessel sidewalls. Since a 
much less detailed model (perfect gas law) has 
been proved to be valid, it would be a better 
choice to model the relationship between P, V 
and T. When developing models of systems, the 
correct level of detail (or hierarchical level) 
depends on the objectives or use of the model . 

The lowest level of detail for modeling will 
correspond to properties of leaves, stems, and 
roots in any given plant population. This tends to 
correspond to the objectives of using the model in 
the ALSS studies. (Of course, monitoring for the 
presence of various types of microbial activity takes 
us orders of magnitude higher in spatial 
granularity.) 

At least two levels of modeling would be 
beneficial. At one level, the objective would be 
rather basic, fundamental questions concerning 
processes and rates of material transfer or 
conversion. Examples would include models which 
permit simulation of nutrient transfer from the 
transport/support media, which could be hydro- 
ponics or simulated lunar soil, ora manufactured 
soil mix based on degradable plant material. 
Simulation of the nutrient transfer rates would 
permit alternative designs of plant containers to 
be compared and evaluated. At the second level, 
the entire population of plants is considered, and 
questions such as percentage of light intercepted 
and rates of photosynthesis, respiration, leaf 
expansion, and translocation are answered on a 
canopy basis, not for individual plants. 


4. Mission Simulations 

One approach to analyzing an ALSS is to 
model all the tasks and resources required to meet 


prescribed performance specifications. Tasks 
include operations such as planting, transplanting, 
spacing of pots, gathering, threshing, cleaning, 
milling, baking, grinding, etc. Resources are labor, 
facilities, equipment, and supplies. For FY94, one 
objective is to build an initial mission simulation 
which can be upgraded when more information is 
available and which can be used in a short cycle 
run-break-fix approach to establish the feasibility 
and requirements of automation and robotics 
components of an ALSS. The software is based on 
a Mission Simulation and Analysis Tool (MSAT) 
(Erickson, Aucoin, and Dragg, 1992). These 
simulations will provide information on event 
timing, utilization of resources, and efficiency of 
rules which allocate scarce resources. The MSAT is 
unique, specialized software which permits close 
scrutiny of tasks proposed for a mission. Scenarios 
are encoded as parallel, interactive processes, and 
the outcome is reported as a sequence of daily 
reports of accomplishments and tasks not done for 
lack of critical resources. The simulator accurately 
accounts for the nonlinear effects of parallel, 
interactive processes, including contingencies and 
constraints, such as mission rules. MSAT keeps 
track of resource utilization and user-defined 
resource capabilities (equipment designs), and it 
won't allow simultaneous use of the same resource 
in different places. From the methodological point 
of view, MSAT is a model-based simulator which 
uses a representational framework with objects, 
processes, events, relations, and states of affairs as 
basic categories of information. This contrasts 
with other representational schemes which use 
fewer primitive types. A user has great flexibility in 
modeling mission processes; providing for options, 
including hazard conditions; and guiding the 
allocation of different types of resources to 
different types of processes (e.g., support, logistics, 
crop handling, maintenance, and so on). 

In 1994, the MSAT simulation of the HRTF will 
primarily focus on the crop planting, growth, 
harvesting, and processing tasks performed by 
humans or those proposed for automation by 
robots. Times and options for performing all tasks 
will be estimated. Observation tables will be 
developed to describe day-by-day tasks that are to 
be accomplished during planting, growing, 
harvesting, processing and recycling of crops, and 
recycling of the growth medium. For each task, 
possible initial conditions will be established. For 
planting, initial conditions might be: 

• Seed loose in a container, 

• Seed tapes, or 

• Seedlings in cell-flats. 
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Each step in the succeeding process will be 
identified and characteristics such as length of 
time, labor, growth area, and storage required for 
each step will be estimated during this early 
attemptto develop a model. Once the biophysical 
process models have been completed, their results 
will be dynamically linked to the operations 
knowledge base so that as estimates of resource 
requirements and ALSS productivity improve with 
additional research results, the design criteria for 
the mission simulators will be updated. Similar 
process descriptions are required for each 
candidate crop. 

In addition to determining the amount of 
resources required for each mission scenario, 
simulations with these operations models wi II 
determine the amount of materials handling 
necessary to produce, harvest, and process edible 
and inedible biomass and recycle the plant growth 
medium for several alternative configurations of 
equipment and the HRTF layout. Several methods 
of handling materials will be proposed and 
evaluated to determine the amount of human 
and/or robotic labor required for each procedural 
step. Simulations will be start to finish, that is, from 
seed (or plant propagules) through all the steps 
necessary to obtain food and provide seed as initial 
conditions for the next crop. The model will 
include transport of materials from one area of the 
HRTF to another. Examples include spacing of 
pots, and subsequent return of unused container- 
plants, and carrying of plant materials from the 
growth area to the processing and storage areas. 

Once the models are completed, mission 
scenarios will be simulated to determine which 
proposed processes lead to the best system 
performances. Indices of ALSS performance will 
include: 

1. Number of astronauts supported, 

2. Manual labor requirements, 

3. Energy consumption, 

4. Minimum reserves for life support elements, 

5. Flexibility, 

6. Adaptability, 

7. Complexity, 

8. Reliability, 

9. Storage volume, 

10. Maintainability, 

11. Survivability, 


12. Launch mass, and 

13. Launch volume. 

Obviously, the number of people an ALSS can 
support, the manual labor requirements, and 
energy consumption are important characteristics. 
Some designs will have more reserve capacity than 
others, and higher marks should be given to those 
with the larger reserves of food and other life 
support elements. The ability to grow more of 
some crops and less of others (flexibility) is 
important, as astronaut's taste preferences may 
change overtime. The ability to quickly adapt to 
changes in mission scenarios due to unforeseen 
events is an important feature of an ALSS and of 
theMSAT models and simulation. Simplicity 
usually meansa lower probability of failure and 
larger chance of completing the mission 
successfully. Simplicity also affects the mainte- 
nance and/or repair of machinery used to 
automate materials handling in an ALSS. Storage 
volume may be viewed as positive or negative. In 
some cases, large storage volumes may be linked to 
large reserve stores. In other cases, the large 
storage volume may be required because some 
components require a long time to recycle. Thus, 
the index of merit may be the total volume 
required per person, although that is obviously 
nonlinear and dependent on the number of 
astronauts supported. The ability of the ALSS to 
maintain itsfunctionality in a survival mode, 
without crew, is an especially important feature. 
Finally, one cannot ignore the costs of getting an 
ALSS into space; thus, launch criteria such as mass 
and volume cannot be ignored. 

Mission scenarios which are infeasible for all 
modeled processes will be identified as high 
priority areas for further experimental research to 
discover more efficient and effective techniques. 


5. Animated Graphics Simulation 

Once the "what to do" steps have been 
defined by the operations simulations, the 
question of "how-to-do-it" remains. This will be 
answered by developing 3-D graphical models of 
the HRTF and by simulating the materials handling 
as performed by people and mechanisms, 
including robots. Unlike the operations simula- 
tions, these simulations will produce realistic views 
of the processes and permit users to ascertain that 
space is available to perform the tasks and to store 
the machinery when not in use. 
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These models will include grippers and 
sensors which enable a robot to perceive changes 
in the environment and adapt to them. Collisions 
between the mechanisms and the HRTF structure 
will be identified by the simulations, and 
additional code will be added to avoid the 
problem with the real-world robots. Software 
tools available in JSC's Integrated Graphics 
Operations Analysis Laboratory (IGOAL) will be 
used to develop the 3-D graphical models and 
perform simulations of the materials handling 
processes which will include seeding, trans- 
planting, transport of containers, monitoring of 
crop health, spacing of containers, gathering, 
separation of edible biomass, processing, storage, 
and food processing. 

Results will be automatically entered into the 
ALSS knowledge base and compared to infor- 
mation from the operations and biophysical 
simulations to determine which combination of 
crops, production, and materials handling 
technologies work best as a complete life support 
system. The data from the graphical simulations 
will include reach and clearances, cinemas from 
multiple viewpoints (including some from 
potential machine vision camera positions and 
robot vision viewpoints), and kinematic analysis of 
manipulator actions, including timing of move- 
ments. These data are necessary to design the 
servos needed to move the materials in the 
allocated time, to determine the number of robots 
necessary, and to determine what each robot 
should do. In some cases, robots might be 
programmed to perform similar tasks in parallel, 
but in other cases, it could be preferable to have a 
robot perform a different task in series with other 
machines. These simulations will also help 
determine what tasks should be automated by 
dedicated, hard-engineered mechanisms and 
which should be performed by flexible, program- 
mable machines (robots). For example, should a 
robot harvest wheat by reaching and removing a 
few grain heads? Or, should it harvest an entire 
container of plants and place the accumulated 
heads in a mechanism which threshes and 
separates the kernels from the straw? Is the 
advantage of having only the head to thresh 
outweighed by the difficulty of locating and 
removing each head of wheat? Is the advantage of 
working with known processes, such as threshing, 
outweighed by the costs associated with the 
limited-purpose device? Should there be a mini- 
combine or separate harvester and thresher, and 
should the plants be moved to the harvester or the 
harvester moved to the plants? Until the 


simulations are performed and the results 
quantified, one guess is as good as another. 


6. Controlled Environment Agriculture 

In controlled environment agriculture, which 
is a several-billion-dollar-per-year business in the 
U.S., intelligent robotics are needed (Simonton, 
1990 Trans.) to keep prices competitive. Market 
forces are compelling greenhouse operations, 
which are labor-intensive, to automate. A major 
motivation is for U.S. producers to improve 
productivity to deal with global competition 
(Carney, 1988). 

The greenhouse production of bedding 
plants and the cell/tissue culture industry could 
readily benefit from the development of systems 
engineering tools and robotics for plant materials 
handling. Tissue cultured plants typically cost 
$0-50 to $1, which limits the market to high value, 
horticultural crops such as ornamental flowers. If, 
through automation, the plants could be produced 
for $0.05 or less, the market would swell to include 
many field crops, such as strawberries. This would 
permit plants with genetically engineered 
characteristics, such as plant-host resistance to 
pests, to be grown commercially without the need 
for chemical pesticides. Even though many such 
genetically engineered plants exist, there is no 
current method of propagating the technology at 
an affordable price. 

Even celt/tissue culture research would 
benefit from robotics. Imagine what would 
happen if a lab technician who normally performs 
40,000 operations every six months in order to find 
a single plant with potential to be genetically 
superior has his/her work assisted by robots which 
perform 40,000 operations every day. That has the 
potential of accelerating research discoveries by 
150 or more. What might have taken 10 years to 
discover would then take a month or so. 

There has been extensive research and 
development of robotics for planting, seeding, 
transplanting, cutting and grafting, tray stacking 
and destacking, and other robotic applications in 
controlled environment agriculture conducted in 
the U.S., Canada, Great Britain, Netherlands, 
Germany, France, Israel, Japan, China, and Taiwan. 
Many of these are documented in three recent 
international symposia (Kurata and Kozai, 1992; 
American Society of Agricultural Engineers, 1991; 
and Kozai, 1988). 
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7. Field Applications of Agricultural Robots 

From the cell/tissue culture lab, through the 
greenhouse, and to the field, the world awaits the 
development of affordable, reliable robotic 
machinery for handling plant materials. Because 
greenhouse growers have discovered that plants 
require different rooting and canopy environ- 
ments, bedding plants are grown in different size 
and shape containers. Thus, any mechanism 
designed for transplanting must be flexible and 
programmable to accommodate these differences 
in size and shape of containers. 

Robotics are an obvious choice for 
automating these delicate, complex materials 
handling problems. Commercial efforts are limited 
to transplanting of bedding plants (Beam, etal., 
1991). University research efforts span a wider 
effort. Purdue University (Miles and Kutz, 1991; 
Kutz, etal., 1987; Kutz and Miles, 1986; and Kutz, 
etal., 1986), Rutgers University (Ling, et al., 1990; 
Ting, etal., 1990; and Ting, et al., 1988), and the 
University of Georgia (Si monton, 1990) have 
studied robotic transplanting and plant materials 
handling. Robotic harvesting of fruit has been 
studied in France (Grand d'Esnon, etal., 1987), in 
Japan (Kawamura, etal., 1986), and in Florida 
(Harrell, 1987; and Slaughter and Harrell, 1987). A 
joint research effort between Purdue University 
(Indiana), the Volcani Institute (Israel), and the 
Weizmann Institute (Israel) has developed a robot 
capable of selectively harvesting muskmelons 
(Edanand Miles, 1993; Benady and Miles, 1992; 
Benady, etal., 1992; Edanand Miles, 1992; and 
Edan, et al., 1992). In this project, fruit location is 
determined by a machine vision and image 
processing system in association with a laser plane 
projector which casts a beam on the melons to 
provide 3-D information on position. Color and 
aromatic volatile gases released during the 
ripening process are used to determine which 
fruits are ripe enough to harvest. Clam shell, ring 
type grippers grasp each fruit and carry it to a 
conveyor to be packaged. Field testing of the 
prototype in Israel by the Volcani Institute has 
been successful, but much work remains to make 
the robot flexible enough to harvest other crops 
and to perform other functions such as 
transplanting in an economically-viable fashion. 

More than any other single factor, the high 
cost of robotic components prevents more 
widespread use of agricultural robotics. The lack 
of research funding to develop affordable sensors 
and servo-mechanisms for agricultural applications 
contributes to this malaise. NASA's funding of 


robotic materials handling research for an ALSS 
would certainly spin-off robotic technologies for 
agriculture. 

The agricultural arena provides exceptional 
challenges in dealing with the mixes of mobility 
and manipulation considerations which need to be 
taken into account in creating and designing the 
next generation of robotic equipment. 


8. Summary and Conclusions 

The development of models and their use to 
simulate the performances of an ALSS would help 
focus experimental research efforts, provide data 
essential for design of a HRTF, and assist the spin- 
off of robotic materials handling technologies to 
agriculture. Simulations of the JSC HRTF will 
include biophysical processes, operations, and 
cinemasof plant materials handling by humans 
and robotic machi nery. Data from models of each 
crop will be structured and dynamically linked to 
shared knowledge bases. 

The development of robots for plant 
materials handling in an ALSS will have immediate 
applications in the bedding plant and cell/tissue 
culture industries. Spin-offs to agriculture would 
be numerous and highly beneficial. 
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Abstract 

The paper describes the concept and design 
considerations of a robotic hauling truck system intended 
for use in surface mine applications. An engineering 
prototype of the designed system will be implemented at 
the Syncrude oil sands mine near Fort McMurray in 
northern Alberta. The robotic truck will perform routine 
hauling functions through a combination of manual, 
teleoperated and autonomous activities. The primary 
objective for the design is to provide an integrated 
intelligent vehicle system that is capable of navigating 
autonomously across outdoor terrain avoiding collisions 
with obstacles that could be encountered. This capability 
will enable the robotic hauling trucks at Syncrude to 
operate autonomously on the routes between the 
loading area and dumping area. Operations at the 
loading and dumping areas will be performed using 
manual or teleoperated control. Future developments will 
allow autonomous operation of the robotic truck at the 
load and dump areas. 


1. Introduction 

The semi-autonomous robotic truck system project is 
being pursued by a research team consisting of 
members from Syncrude Canada Ltd., Defence 
Research Establishment Suffield and the Alberta 
Research Council. 

Syncrude Canada Ltd. (Syncrude) operates a world 
scale oil sands mining, extraction and upgrading 
operation near Ft. McMurray, in northern Alberta. In 
1992, about 50 million cubic metres of overburden were 
removed to expose the oil sands ore. Currently thirty 170 
ton hauling trucks are used 24 hours per day to remove 
overburden. Within four years it is estimated that fifty 
hauling trucks will be necessary, with the balance of new 
trucks having a 240 ton capacity. Syncrude is 
considering automating the operations of its hauling 
truck fleet in order to reduce the mine operating costs. 


Defence Research Establishment Suffield (DRES) has a 
well established robotic vehicle development program. 
Land, air, and sea based vehicles have been 
successfully developed for applications including live fire 
target transport, surveillance, hazardous material 
handling and mine detection. An important result from 
this work is the development of ANC/EUS, an advanced 
and proven generic remote vehicle control system. 
ANC/EUS is an inexpensive semi-autonomous vehicle 
control technology able to interface with virtually any 
surface vehicle as well as with their payloads. 

The Alberta Research Council (ARC) is the oldest and 
largest provincial research organization in Canada. The 
ARC is experienced in leading and participating in 
technology development and technology transfer 
projects in many fields including autonomous systems, 
robotics and advanced computer applications. 

The intent of the combined research team is to transfer 
the military robot vehicle technology to a prototype 
robotic truck application in Syncrude's mine. The 
following sections of the paper describe the application 
in detail, the technical challenges and the concept for the 
prototype vehicle. The paper concludes with comments 
about the status of the development effort. 


2. Current mine operation overview 

Syncrude operates a large surface mining plant in a 
region of northern Alberta called the Athabasca oil sand 
deposits. These deposits are one of the largest oil 
reserves in the world, with almost 1000 billion barrels of 
oil, over four times the amount in Saudi Arabia. Of the 
Athabasca reserves, about 10% are recoverable by 
surface mining techniques, and over half could be 
recovered by in-situ methods. 

In the Syncrude area, the oil sands average about 42 
metres thick, and contain an average of 11% oil by 
weight. Syncrude mines and processes oil sands 
containing over 6% oil, with any lower grade material 
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being treated as waste. The oil sands are covered by 
varying amounts of overburden. At Syncrude the 
average overburden depth is about 20 metres. The mine 
location was chosen to minimize overburden removal 
required during the initial years of mining. 

2.1. Oil sands surface mine operation 

The first stage in the Syncrude operation is to remove 
trees, muskeg and any reclaimable soil material. This 
material is stored for later reclamation of the mined out 
area. The overburden is then stripped using a truck and 
shovel fleet. 

The underlying oil sands material Is mined in two stages, 
the first using four large draglines with 65 cubic metre 
buckets which excavate the oil sands and place the ore 
in linear piles called windrows. The draglines also reject 
waste bands in the oil sand, and dump this material back 
into the mined-out pit. In the second mining stage, the 
windrows are recovered by four bucket wheel reclaimers 
that place the oil sands onto conveyors for transportation 
into the crude oil producing plant. There are four trains 
of conveyors, with belts 1.8 metres wide, and total 
conveyor lengths of up to 7 kilometres. The individual 
conveyor flights are up to 2.5 kilometres long. In the oil 
producing plant, the oil sands material is appropriately 
processed to produce synthetic crude oil, that is finally 
pipelined to markets. 

2.2. Hauling operations 

Syncrude currently uses its 170 ton hauling truck fleet to 
transport overburden covering oil sands. The trucks are 
loaded using mining shovels, either with waste 
overburden or with oil sand. The trucks conveying 


overburden travel to a waste dump area, most of which 
are located in the mined out pit. The trucks dump the 
overburden and then return to the loading area to repeat 
the cycle. 

The trucks carrying oil sand travel to an Auxiliary 
Production System (APS) crusher/feeder. The oil sand is 
dumped in a hopper, crushed and fed onto a conveyor 
system for transportation into the plant. Again, the truck 
returns to the loading area to pick up additional loads. 
Figure 1 illustrates the major functions of the hauling 
truck operation. 

• The loading area is at the shovel or loader. The 
location of, and routing to this area changes 
frequently. 

• The main loaded haul, between the loading area and 
the dump (or crusher). Normally, this is a relatively 
long section, and the routing will not change for 
particular load point and dump. 

• The return empty haul is often, but not always, the 
same as the loaded haul route. 

• The overburden dump area is where the waste 
material is dumped. The location and routing across 
this area changes frequently. 

• The APS crusher or feeder-breaker is used for oil 
sand feed. This location is fixed. 

• Special operations, such as fueling, maintenance, 
extraction rejects, moving to a new location or 
operation, and other non-routine tasks. 



Empty Haul 

Figure 1 • Hauling Truck Operation 


The degree of automation that can practically be applied 
to the different hauling operations will vary. The fully 
automated mode is easiest to implement on the loaded 
and return hauls. Initially, manual or tele-operated 
modes will be used at the shovels and dump areas. The 
overburden dump area Is the hardest to automate 
because its location changes quite frequently. Special 
operations, such as driving into maintenance bays, 
would be carried out manually. 


2.3. incentives for automation 

There are a number of incentives to automate the 
trucking operation: 

• Productivity Increases. The use of robotic trucks 
could lead to higher truck utilization and thus higher 
productivity, as there would be less downtime due to 
operator breaks and no delays at shift changes. 
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• Maintenance Costs. Robotic trucks should drive 
more consistently, giving lower maintenance costs, 
and also higher availability. 

• Reduction in mine operating costs. There is a 
potential in a robotic truck mining scheme to reduce 
the number of operators, and thus save payroll costs. 


3. Robotic Truck System concept 

Automating hauling truck fleet operations Is based on the 
development of a semi-autonomous robotic vehicle that 
could be operated remotely. Remote operation of a 
driverless truck will require integration of several 
technologies on board the vehicle. These technologies 
include communication, control, guidance, obstacle 


detection, collision avoidance and condition monitoring. 
The concept of a semi-autonomous robotic truck is 
illustrated in Fig.2 

The robotic truck system could have three modes of 
operation: manual, teleoperated, and fully autonomous. 
In the manual mode of operation a driver has full control 
over the vehicle and performs all functions as if no 
automation were implemented. When a vehicle is 
teleoperated, there is no driver in the cab, and the truck 
is driven and controlled remotely from a control station 
using video monitors and input devices to control speed, 
steering and other functions. In the autonomous mode of 
operation, the vehicle drives independently using a 
navigation system for setting route to the destination as 
well as an obstacle detection and collision avoidance 
system. 



Figure 2 - Robotic Truck System Concept 


Initially, it is expected that the robotic truck will be 
operated in manual or teleoperated mode at the load 
(shovel or loader) and dump (dump or crusher) areas. 
The robotic truck will operate autonomously on the 
loaded haul from the load area to the dump area, and on 
the empty return haul from the dump area back to the 
load area. Special operations (fueling, maintenance, 
etc.) will continue to be carried out manually. 

A fully loaded truck may weigh as much as 500 tons and 
be capable of traveling at 35 mph. Collisions either with 
obstacles such as big rocks or with other vehicles would 
have to be avoided. All authorized vehicles in the area 


of truck operation will be fitted with transponders, and all 
automated truck haul roads will be closed to normal 
traffic. However, the robotic truck must also be able to 
avoid collisions with unauthorized equipment, personnel, 
large rocks and animals. This issue is discussed in more 
detail in Section 3.4. 

The major technologies needed for the development of a 
semiautonomous robotic truck system are at different 
stages of maturity. The current status of these 
technologies is summarized in Table 1. A detailed 
discussion of enabling technologies follows the table. 
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TECHNOLOGY 

DESCRIPTION 

CURRENT STATUS 

Communication 

Communication between trucks 
and the central office, etc. 

Systems are available. 

Control 

Overall control of the truck. 

Computer control systems are 
available. Software development is 
needed. 

Guidance 

Guidance and location of trucks 
along a road. 

A number of techniques have been 
developed, but further work is 
needed. 

Obstacle Detection and Collision 
Avoidance 

Avoidance of objects, people, etc. 
in the path of the truck. 

Some sensors are available, but 
significant work is needed. 

Condition Monitoring 

Monitoring of truck health and 
operation. 

Systems available, further 
development is needed 


Table 1 Status of technologies needed for robotic trucks 


3.1. Communications 

A reliable means of communication between the truck 
and the location of the operator station is needed. Data, 
control signals, and video images from the truck all need 
to be transmitted. Fail-safe communications systems are 
also important given the potential for communication 
links to be compromised or broken in an outdoor 
environment DRES has addressed all of these issues 
with the ANC/EUS robot vehicle control system using 
conventional radio communication systems. 

3.2. Control 

Automatic control of the robotic truck is essential. There 
will need to be features to permit a driver to over-ride 
automated systems and manually drive the truck when 
needed. The automatic control functions will need to 
accommodate teleoperation of the truck when required. 
The ANC/EUS system addresses most of these control 
requirements. 

3.3. Guidance 

When the semiautonomous truck is moving under 
automatic control, some form of guidance system is 
essential. Possible systems include: 

1. Guide Wires. 

2. Global Positioning System. 

3. Microwave Location. 

4. Lasers. 

5. Inertial Systems. 

6. Dead Reckoning. 

Each system has its advantages and disadvantages. 
DRES has successfully incorporated Global Positioning 
System technologies into the ANC/EUS control system. 
Other guidance systems are commercially available. In 
mining applications more than one guidance system will 
be implemented to provide system redundancy for 
failsafing. 

3.4. Obstacle detection and collision avoidance 

Detecting obstacles in the way of the truck and avoiding 
collisions are critical capabilities for the robotic truck. A 
driverless truck cannot be considered safe to operate 


unless there Is a rugged, reliable and fail-safe method for 
obstacle detection and collision avoidance. The types of 
obstacles that need to be considered include: 

• Other automated haul trucks 

• Authorized shovels, bulldozers, graders, 
pickup trucks etc. 

• Authorized people 

• Unauthorized people, animals, vehicles 

• Rocks on the road dropped by other trucks 

• Potholes In the roadway 

The primary sensing techniques that could be used for 
obstacle detection include: 

1. Infra-red sensors. 

2. Ultra-sonic sensors. 

3. Optical sensors. 

4. Microwave sensors 

5. Laser scanners. 

6. Computer Vision systems 

No one sensing technique will effectively detect all the 
obstacles that need to be considered. Combinations of 
these sensors, each detecting a range of different types 
of obstacles, will likely be required to handle all the 
anticipated circumstances. 

These primary sensing systems would be directly 
determining the presence of obstacles in the path of the 
truck. These systems would be backed up by other 
systems such as: 

7. Mechanical bumpers mounted around the 
truck. 

8. Security approaches to prevent unauthorized 
entry of vehicles and people into the truck's 
zone of operation. 

9. Emergency stop 'buttons' located at strategic 
places around the truck’s zone of operation. 

10. Transponders mounted on authorized vehicles 
and personnel to provide automatic 
identification to the truck. 

11. Video images from the truck to the remote 
operator. 

3.5. Condition monitoring 

In a manually driven truck, the operator will often notice if 
the truck is operating abnormally by listening to the 
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engine, feeling any vibrations, or iooking at the gauges. 
In an automated truck, a condition monitoring system is 
needed to detect problems in truck operation before any 
serious damage to the truck occurs. 

Capabilities are required for detection of incipient 
problems with minimai faise aiarms. The information will 
need to be communicated to the location of the operator 
for processing or for maintenance cali-out. Condition 
monitoring systems are now available, but further 
development will be needed to create a system that is 
able to detect and identify subtle changes in truck 
operation. 


4. System design 

The prototype robotic hauling truck system will consist of 
two major elements: truck control station and on-board 
equipment mounted on each truck in the fleet. The 
overall system is illustrated in Fig.3. The design of the 
prototype system is derived from previous 
implementations of the ANC/EUS robotic vehicle control 
technology developed by Defence Research 
Establishment Suffield. 
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Figure 3 - Robotic Truck System 


4.1. On-board subsystems 

The modular nature of ANC/EUS (Fig. 4) will allow the 
control system to be efficiently installed on the robotic 
truck. Only one module needs to be redesigned to suit 
the control inputs and outputs of the truck. This is the only 
major piece of development to create the basic vehicle 
control system. Other ANC/EUS modules in the basic 
vehicle electronics package (i.e. excluding obstacle 
detection and collision avoidance and health monitoring) 
remain the same and do not require redesign. 

All modules are interconnected with a high speed fibre 
optic data link that provides effective noise immunity. The 


modules communicate by passing high level commands 
(which remain constant for all vehicle types). 

The vehicle electronics package consists of four primary 
modules: the vehicle control processor, the vehicle 
personality module, the video control module and the 
vehicle navigation module. 

The vehicle control processor module functions as a 
network gateway between the control data link and the 
vehicle fibre optic data link. High level functions of the 
vehicle, such as semi-autonomous navigation, health 
monitoring, obstacle detection and collision avoidance 
would also reside in this module. 




The vehicle control processor is housed in a rugged 
temperature-controlled weather-proof enclosure. The 
main processor (currently a 68030 single board 
computer) is plugged into the 12-slot VME back plane. 
As the system develops over time, more processors 
and/or special hardware, such as vision processors, can 
be added. 

A novel approach to temperature control of the enclosure 
allows commercial grade components to function where 
industrial or military grade components were previously 
required. The temperature is maintained by using 
thermo-electric heat pumps, which can either heat or 
cool the enclosure. 

Software for the vehicle control processor is written in C 
and speed-critical routines have been written in 
Assembler. The code can reside in Erasable 
Programmable Read Only Memory (EPROM) or can be 
loaded from disk. 

The vehicle personality module controls all vehicle- 
related activities on the robotic truck such as starting and 
stopping the engine, steering, speed control, and 
braking. The personality module also monitors vehicle 
information such as engine temperature and revolutions 
per minute, charging current and voltage for the primary 


battery and other useful vehicle parameters. 

The video module allows up to 256 video and sound 
inputs to be switched to up to four different RF video 
transmitters and/or fibre optic video channels. As well as 
switching video and sound, each channel can have an 
associated pan, tilt, zoom and flood lamp control 
channel. 

All functions associated with determining the truck's 
position reside in the navigation module. Any module 
requiring navigation data can request it from this module. 
The current ANC/EUS system uses a Trimble GPS 
receiver to obtain global positioning data as well as pitch 
and roll sensors to alert the vehicle to dangerous 
attitudes. The receiver can also make use of differential 
GPS data to increase its accuracy. The specific 
navigation technology to be used for the truck has yet to 
be determined. 

The application modules connect to the vehicle fibre 
optic data link in the same manner as the four primary 
modules. Application modules are required to implement 
any functions not included In the primary four modules. A 
typical function for an application module may be the 
control of a robotic arm. Currently, no application 
modules are projected for the prototype robotic truck. 
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Figure 4 • On-board subsystems 


4.2. Control station 

The control station (Fig. 5) for the robotic truck is modular 
in nature, consistent with the design philosophy of the 
ANC/EUS system. The design allows different control 
consoles to be connected as well as efficient interfacing 
for a given application. It is anticipated that the control 
station for the truck will transmit data to and from the 
vehicles through a RF modem. The control station is 
further composed of three modules: the control station 
computer, video control module and the control console 
module. 


4.3. Additional Features of ANC^US 

The primary ANC/EUS design goals are flexibility and 
resistance to obsolescence. To this end the system is 
modular, which allows it to be easily upgraded. The 
system also can be placed on most vehicle chassis with 
a minimal amount of redesign due to the unique 
personality module concept discussed above. The 
personality module accepts high level commands from 
the control station and implements these commands for a 
particular vehicle. 
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Figure 5 - Control station 


The ANCvCUS system also has the following features: 

• Up to 254 vehicles can be addressed on one 
network. 

• Up to 255 hardware modules (personality or 
application) can be installed on each vehicle. 

• Hierarchical vehicle safety built into network. 

• RF data link repeater vehicles are possible. 

• Modular vehicle system: 

uses onboard fibre optic data link for noise 
immunity; 

allows quick replacement of faulty hardware, 
allows modules to be placed close to 
controlled activity; 

allows easy installation of application specific 
modules; 

distributed processing increases vehicle 
processing power; 

• Uniform control code structure. 

• Application control codes require no software 
changes. 

• Temperature-controlled Vehicle Control Processor 
uses commercial components in adverse 
environmental conditions 

• Built-in video management 

• Can be efficiently Installed on most vehicle chassis. 


5, Related work 

There has been considerable work done on automation 
of both surface and underground mining. In North 
America, there appears to be more automation progress 
in underground mining than in surface mines since, in 
some underground mines, safety aspects give large 
incentives to develop driverless vehicles. The following 
are some of the key robotic truck research activities that 
have been identified and have relevance to this project: 

• INCO has developed an automated 70 ton 
haulage truck (AHT) that uses an "I" beam on 


the roof of a mine tunnel for guidance [1]. The 
AHT can haul, dump and return automatically, 
but to date some manual input is required for 
loading. The overall project started in 1983, 
and the AHT was commissioned In 1989, and 
has been running successfully since then. 
INCO is also working on an automated LHD 
vehicle, that is loaded using teleoperation, 
travels along a tunnel automatically, and is 
dumped manually [5], The LHD is guided by a 
reflective strip in the ceiling of the tunnel. Tests 
of the LHD have been quite successful. 

• Noranda As a joint project with INCO, 
Noranda is working on an automated LHD 
vehicle that can muck automatically. This LHD 
is fitted with many sensors to determine the 
vehicle health. 

• Potash Corporation of Canada, have 
automated a continuous borer type mining 
machine. A large number of sensors are used 
to detect ore grade and machine health. A 
laser beam is used to provide guidance in the 
tunnel. The machine can be operated in fully 
automatic, semi-automatic or in manual 
modes. 

• Defence Research Establishment 
Suffleld (DRES), have developed an ARGO 
8 wheel ATV for full automation and tele- 
operation [2]. The system was developed in a 
four month period to detect land mines. The 
control and sensor package was developed to 
be very rugged (near military specifications) 
and to be largely vehicle independent, with 
only one system requiring change to fit on 
another type of vehicle (such as a mining 
truck). No collision avoidance sensors were 
installed [8]. 
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• Caterpillar is developing a robotic vehicle. 
Little information is known about this work 
since it is proprietary to Caterpillar. 

• BHP in Australia has carried out work on 
automation of different mining equipment. 

• Kamatsu of Japan is researching a robotic 
truck but, according to the literature, the work 
does not appear to be as advanced as the 
ORES ANC/EUS system. 

• The Alberta Research Council (ARC) has 
been developing technology to support control 
of robotic devices in the presence of low 
communication bandwidths for the Canadian 
Space Agency [7]. This technology could be 
applied to the control of multiple robotic 
vehicles. 

• PRECARN Associates Inc. through its 
ARKS, IRIS, and IGI projects has made several 
technology developments that are applicable 
to the semiautonomous mining truck [3]. The 
proposed PRECARN MAP project also has 
objectives that are similar to the 
semiautonomous truck development and there 
is potential for a significant degree of synergy 
between the two projects. 

• The National Research Council of 
Canada has developed a number of 
technologies that are relevant to this study [4]. 
They include semi-autonomous robots, 
computer vision systems and other sensor 
developments. 

• CMU (Carnegie Mellon University, Pittsburg) 
has developed and tested a navigation system 
(FASTNAV) implemented on a 150 ton 
Caterpillar dump truck [6]. 


6. Current Status 

The robotic truck project is currently in the initial stages of 
development. Research related to collision detection 
and obstacle avoidance systems has begun and 


preliminary results are expected by mid-summer 1994. 
By fall of 1994 a detailed design concept for the 
prototype robotic truck is expected to be ready and an 
operation prototype ready for testing by winter 1995. In 
parallel with these technical developments the project 
team will also be examining the issues of integrating a 
robotic truck fleet into routine mine operations and 
developing strategies for bringing this technology to the 
international market. 
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Abstract 

A new method to stabilize a vehicle, which is autonomously 
guided on the road by an optical sensor, is presented. Since the 
dynamical behavior of a vehicle is nonlinear and contmns mutual 
couplings in the state variables, the design especially of algo- 
rithms for transverse control is based on the universal valid prin- 
ciple of nonlinear decoupling and control. This method works 
independent from any operating point and thus a high degree 
of accuracy and free pole assignment for the overall system is 
provided. 

1 Introduction 

In the last years many efforts have been achieved in realizing 
and integrating automation components in motor vehicles. 
In this context the task is followed to support the driver 
by efficient kinds of control and safety systems [1]. In this 
way anti block systems, methods to prevent a vehicle from 
skidding on icy roads [2] as well as the application of col- 
lision avoidance strategies are described [3]. Besides a very 
important aspect to increase safety in traffic is to direct a 
vehicle equipped with an on-board system automatically on 
the road [4]. Here a method is described, where the geometry 
of the street is detected by an optical system mounted on the 
car. Based on the information coming from this sensor the 
control system provides adequate control signals to stabilize 
the vehicle on the right lane. As the street has to be followed 
exactly, a high quality control system for the vehicle is re- 
quired. In this paper a very efficient strategy of transverse 
control based on nonlinear formulations is presented. Fur- 
thermore a longitudinal controller to influence the velocity 
of the vehicle is described. 


2 Mathematical model of the vehicle 

As already mentioned, the task is followed to develop a con- 
trol system, which directs the vehicle automatically along 
the road under consideration of a. nominal speed. The right 
edge of the street is traced by an optical sensor, e.g. a video 
camera. In this way the distance Ur between the point P on 
the optical axis and the point E on the edge of the road 
can be determined. By further consideration of distance Uo 
between point P and the center of gravity of the vehicle a 
proportion of the car’s direction of movement in dependence 
of the course of the road can be indicated. This structure 
is illustrated by Fig. 1, where especially in the lower sketch 
the kinematics of the vehicle including the optical sensor is 
shown. Please note that represents a constant parameter 
and, in contrast, Ur is a dynamic variable. 

To develop an efficient control system a mathematical 
model describing the dynamical behavior of the vehicle is 
a very important supposition. Besides the requirements of 
high accuracy this model, which consists of several differen- 
tial equations, should be of certain clearness. A single track 
model describing transverse and longitudinal dynamics, ne- 
glecting roll and pitch angles and comprising front and rear 
wheels to one fictitious wheel respectively is well suited [5]. 
It is transferred into state space description, which will be 
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output vectors u(t) and y(^) are introduced as follows: 



Inputs are front side force 5^ of the vehicle depending on the 
steering angle by 


and rear driving or braking force H, while output variables 
are the yaw angle describing the orientation of the vehicle 
and the velocity v of the car. As shown in Fig. 2, impor- 
tant state variables are the yaw angle the yaw velocity 


Figure 1: Mode of operation of the sensor 


the starting point for the design of the path control system: 
r ^ 1 T(x) XI 1 5,(x) 1 


Figure 2: Dynamical variables of the vehicle 


which consists its first derivation, the longitudinal velocity v 
and the sideslip angle /?. The vehicle is of mass m and of mo- 
ment of inertia while c^, c/i, /„ and are further constant 
parameters. The variables 5/i(x) and T(x), which consist of 
the rear side force and the air resistance respectively, depend 
on their turn on the state variables and can be computed by 


Aerodynamic resistance coefficient, atmospheric density and 
surface of the vehicle’s cross section are represented by the 
parameters c^y, and A. To keep clearness of the model 
equations for the derivation of the control laws these vari- 
ables are not inserted explicitly. In this way also the front 
side force is stipulated as an input variable of the model. 

Furthermore the actual position coordinates A"' and Y of 
the center of gravity in the stationary system is evaluated 


The 4-dimensional state vector x(0^ which consists of the 
dynamical variables, as well as the 2-dimensional input and 
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with respect to the differential equations 

X = V cos(V> - /3) 

(5) 

Y = V sin(ip — 0). 

However, the coordinates Xp and Yp of the point P on the 
optical axis, which is of distance lio to the center of gravity 
are determined by 


Xp = X + lio cos xp 

(6) 

Fp = X + lio smxp. 

Please note that equations (5) and (6) are only required for 
the simulation. As shown in section 3, however, they are not 
required to develop the control algorithms. 


3 Nonlinear Path Control 

3.1 Design of the control algorithms: 

The dynamical behavior of a vehicle is extremely nonlinear 
and consists of mutual couplings. Control laws, which are 
based on a linearized model, would have the disadvantage 
of dependence from an operating point. However, if the de- 
sign of the control laws is based on the universal method of 
nonlinear decoupling and control, all couplings and nonlin- 
earities are compensated in a direct way and the controlled 
vehicle will be impressed a linear behavior with free pole as- 
signment [6]. Based on state space description (1) the task 
is to find algorithms to control the vehicle’s yaw angle xp 
and the velocity v by influencing front side force S^, and rear 
longitudinal force H. 

The elements Ci(x) and C 2 (x) of the matrix C(x), rep- 
resenting the coordinates of the car’s center of gravity, are 
related to a subsystem each [7]. For both subsystems first the 
differential orders, which determine the order of the output 
variables’ derivations connected in a direct way to an input 
variable, have to be found out respectively. To determine the 
value greater zero for the subsystem i the differential oper- 
ator N^Ci{x) with k = 0,1,2,..., which can be recurrently 
evaluated by 

N^Ci (X) = [|j iV^>C.(x)]A (x) (7) 

under consideration of the initial value 

KCi (x) = C,(x), (8) 

is introduced. Hence the differential orders of the subsystems 
are evaluated by the following formulation: 

di = min [j : (x)] B(x) ^ 

“ 'I (^) 

; = 1,2,. ..,n| 

with 0 as the zero vector. With respect to state space de- 
scription (1) the differential operators X^C't(x) as well as the 


parts A^^C.(x)]B (x) are evaluated for subsystem ^ = 1 
to ” 


iV“C,(x) = 

^5C^i(x)]b (x) = [O O] 
N\Ci(x) = X3 

[| A^iC,(x)]B (X) = 0], 


(10) 


( 11 ) 


Furthermore for subsystem i = 2 these parts yield 


Af^C2(x) = X, 

|| (vSftwIfi to . |o i|^ 

As for subsystem i = 1 part (x) = 0^ but 

part X^Ci(x)]b (x) ^ 0^ and for subsystem z = 2 al- 

ready part X^C 2 (x)] B (x) ^ 0^ the differential orders di 
for both subsystems result in 


= 2 

d2 = I . 


(13) 


These values for the differential orders signify the immedi- 
ate effect of the system inputs on the second derivation of 
the yaw angle and the first derivation of the car’s velocity. 
As the output variables consist of the yaw angle, which is of 
kinematic character, and the velocity representing a kinetic 
variable, one receives different values for the differential or- 
ders of each subsystem. 


For further consideration also the operators 

^lC.(x) = 

N\C2(x) = -^xl 


(14) 


have to be calculated. Thus the 2x1- and 2x2-matrices result 
in 


C*(x) 


(x) 

(x) 


Sh h 



m 


2 

4 


(15) 


and 


D*(x) = 


4 (i) 

^ JVf-'C, (5) 



■ k 

0 

B(x) = 

0 

1 


0 

m . 


(16) 

respectively. These matrices are basic requirements to derive 
the nonlinear control algorithms which satisfy the universal 
equation 


u(<) = D*(x)-*{-C’(x) + A w(0-M*(x)}, (17) 



where the control signals u(<) are generated in dependence 
from the state variables x(f ) and the nominal values w(^). As 
the elements of \v{t) correspond to the elements of y(i), the 
nominal values for yaw angle and velocity are determined by 
wi(t) and W 2 {t) respectively. By the 2x2-matrix A, composed 
only by elements on the main diagonal, the nominal values 
are amplified, while by the 2x1 -matrix M*(x) po\e assignment 
for the overall systems is carried out. 

The existence of the inverse of D*(x) is a necessary and 
sufficient condition for the decoupling of the system of dif- 
ferential equations (1). With respect to (16) the determinant 
of D*(x) results in 



whereby the inverse is calculated to 


0 


0 
I 

0 m 


(19) 


To complete the control laws the matrices A and M*(x) have 
to be specified explicitly. By 


A = 


diagjAi, Aj} 


Ai 0 

0 A 2 


(20) 


the nominal values are weighted with respect to the amplifi- 
cations Ai and A 2 , while by 


E* Kc,{x) 

k-o 


aoi ^2 + <^11 ^3 

E iV^C2(x) 

. k =0 


ao2 X 4 


M"(x) = 


(21) 

the dynamical behavior of the decoupled subsystems can be 
adjusted by the constant parameters aoi, <^02 S'lid an. 


With respect to the universal formulation (17) and the 
corresponding matrices in (19), (20) and (21) the control 
laws are evaluated to 


Ui{t) = ^ 5 h(x) +J- (Ai Wi(t) - doi - an X3) 
U 2 {i) = T(x) +m (A 2 W 2 {t) — ao 2 x^) . 


( 22 ) 


As the control algorithms require the actual values for 5/i(x) 
and T(x) of rear side force and air resistance, these variables 
are provided with respect to equations (4) in advance. 


3.2 Dynamical behavior of the overall system: 

With respect to the control algorithms (22) the dynamical 
behavior of the overall system is examined. Considering (1) 
and (13) these derivations yield 


y\ = ^3 
V2 - 2^4. 


(23) 


variables yi and y 2 have to be derived respectively. Replac- 
ing the first derivations is and i 4 by the corresponding right 
sides of (1) and by further substitution of the control vari- 
ables Ui{t) and U2(0 right sides of (22) the overall 

behavior of the controlled vehicle results in 


yi + in + otoi yi - Ai tyi(t) 

2/2 + 0^02 2/2 = A2 n^ 2 (^)- 


(24) 


By these equations the overall dynamics of the vehicle de- 
scribing in transverse direction a behavior of second order 
and describing in longitudinal direction a behavior of first 
order is determined. Desired variables are u^i(f) for the nom- 
inal yaw angle and 1 ^ 2(0 desired velocity of the car. 

With the free choosable parameters Ai, A 2 , aoi, an and ao 2 
the dynamics of the controlled vehicle can be adjusted. It is 
sensible to introduce the following restrictions: 

aoi = Ai 

an = 2 (25) 

ao2 = A 2 . 

In this way considering the stationary case the nominal val- 
ues and the corresponding output variables are weighted by 
the same parameters. Furthermore the aperiodic borderline 
case is adjusted in transverse direction. 

With the remaining degrees of freedom consisting of the 
parameters Ai and A 2 rapidity of control circuits can be in- 
fluenced by pole assignment. Here technical constraints of 
the vehicle have to be taken into consideration. 


3.3 Modification of the nonlinear control laws: 

As the value of the yaw angle, represented by variable X 2 
in (22), is not measured and also a nominal value lui(f) for 
the yaw angle does not exist explicitly, an adequate modifi- 
cation of the relevant control algorithm is necessary. Rather 
the task is followed to find a control system, which adjusts 
automatically the car’s direction of movement considering 
the course of the road. In this context the edge of the street 
is realized by the optical sensor mounted on the car. Ori- 
entations of the car and of the sensor are identical. So a 
control method must be found to adjust the yaw angle in de- 
pendence of the signal coming from the optical system. Due 
to these requirements in the control law for the yaw angle, 
which is represented by the first equation of (22), the vari- 
ables X 2 and iy;i(/) have to be eliminated. In this way first 
the parameters Ai and aoi are equated to 

Ai = aoi, (26) 

so that the control error Wi{t) — X 2 can be factored out. The 
task is to substitute the control error in an adequate way. 
The deviation between the optical axis of the sensor and the 
edge of the street is determined by 

hr 

(j) = arctan-—. (27) 

ho 


In this context di and c ?2 indicate, how often the output 


As the control system has to direct the vehicle on the right 
lane with nominal distance wur to the right edge of the road, 
an offset 

w^{i) = arctan — ( 28 ) 

ho 

has to be taken into account. By equation 

tei(<)-x2 = uv(0-<^ (29) 

the control error is substituted by the difference w^{t) - <p. 
1 his relation is illustrated by Fig. 3. If further the parame- 
ters A 2 and 0 !q 2 are equated the control laws (22) consider- 
ing (29) are transformed into 

uiit) = — «^/i(x) — <l>) — J’a) 

ly ) ( 30 ) 

^2(0 = ^(x) -frn (A2 (w^ 2(0 “ -2:4)). 

Herewith based on the present kinematic structure and the 
available sensors transverse and longitudinal dynamics are 
controlled. The desired values are represented by the nominal 
distance of the path of the vehicle to the right edge of the 
street and by the nominal velocity. 



Figure 3: Determination of the transverse control error 


3.4 Processing of original control signals: 

By equations (30) the required values of the control variables 
for front side force Sy and rear driving or braking force H are 
computed. As the course of a vehicle is determined by the 
steering system, an algorithm is required to generate sub- 
sequently the corresponding value for the steering angle by 
representing an original control signal in dependence of front 
side force Sy. In contrast control signal H for driving or brak- 
ing force is correlated directly to the vehicle and thus it exists 
already as original control signal. For better distinction the 
original control signals ui and U 2 9 'i‘e marked by a bar. 

Based on equation (3) and with respect to (2) the original 
control signal Ui is generated in dependence of the value Sy 
for front side force, which is represented by ui and evaluated 


by control law (30). In contrast the original control signal U 2 , 
which is identical to the desired value H for the rear driving 
or braking force, is represented by control variable U 2 - 


Ul — Oy — Xl -f" ly 

Cy 

U2 — H = U2. 


(31) 


In this way the original control signals u(/) are generated 
in a tandem arranged unit in dependence of vector u(/) and 
state variables x and subsequently they are transferred to 
the actuators of the vehicle. 


3.5 Simulation results: 

To demonstrate the quality of the control system, the dynam- 
ical behavior of the controlled vehicle was examined by com- 
puter simulations. Vicarious in Fig. 4 the resulting manoevre 
based on a sudden 45°-bend of the right edge of the street, 
w'hich represents a hard test for the control system, is shown. 
This bend can be recognized by the graph marked with ’E’, 
which consists of the fictitious path of the point E (see also 
Figs. 1 and 3). In contrast by the graph marked with T’ 
the in the same way fictitious path of the point P, which is 
located on the optical axis of the sensor system and which 
is here of distance //o = 5 m to the car’s center of gravity, is 
shown. It can be clearly recognized, how the vehicle, which 
drives a speed of 50 km/h, turns in this way that the con- 
trolled condition to keep the nominal distance of wur = 2 m 
between the points P and E is followed. With respect to (25) 
the path of the vehicle’s center of gravity drives through the 
bend with an aperiodic behavior and follows the street in 
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x-coordinate [m] 


Figure 4: Dynamical behavior of the controlled vehicle 



an exact way. Furthermore the course of the original con- 
trol signal for the steering angle is described by the graph 
marked with ’1’ in dependence of the actual position of the 
car on the x-coordinate. 


4 Conclusion 

An on-board system for transverse and longitudinal control 
of an automatically guided vehicle based on nonlinear decou- 
pling strategies is presented. With respect to the transverse 
dynamics of the car the control signal for the steering system 
is generated in dependence of the information recorded by an 
optical sensor, which detects the edge of the road. In contrast 
the drive and the brakes are influenced by the longitudinal 
controller taking the nominal and the actual speed into con- 
sideration. Control signals for the steering system as well as 
for the drive and and the brakes of the vehicle are gener- 
ated by nonlinear algorithms. This method is especially dis- 
tinguished by its efficiency and its preknowledgement about 
the dynamical behavior of the overall system. Herewith an 
important requirement to increcise safety in future traffic is 
realized. 
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Abstract 

Research in multiple, robotic agents is gaining the 
interest of an ever increasing number of researchers. 
Many of these researchers have previously worked in 
simulation or with single robots, or both. Making the 
transition from a simulator to the real world can be 
very trying and frustrating to someone with no expe- 
rience with such a project. The same goes for making 
the transition from a single robot to multiple robots. 
There are a number of issues that arise, mostly of the 
practical and pragmatic variety, that escape consider- 
ation by researchers making these transitions for the 
first time. We hope to highlight the most important 
of these issues - discovered primarily through expe- 
rience with working on multi-robot projects, two of 
which are discussed in the paper - so that other re- 
searchers can give them full consideration when work- 
ing on their own projects. In addition, we give some 
suggestions as to how to eliminate or minimize the 
negative impact these issues might have upon the de- 
velopment of a multiple robot project. 

Introduction 

A time will come when it will be common to see 
autonomous robots working together in teams or in- 
teracting as individuals. Each of these robots will be 
performing its specific role in achieving whatever it 
has been given as tasks. Individual robots, regardless 
of whether it is working in a team or not, will dy- 
namically interact with each other, the environment, 
and with humans. They will communicate necessary 
information in noisy environments, fill in for fallen 
comrades, and adapt to the temporary loss of sensor 
subsystems. This scenario is still a long way in the fu- 
ture. What will it take to make this a reality? While 
research in Robotics and in Distributed AI is always 
pushing toward this future, research is still in its in- 
fancy compared to what is necessary before robots 
can function as described above. 

Quite a bit of research has been done regarding 
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issues related to multiple agents,^’ and some has 
been done specifically for multiple robots.^’ How- 
ever, none of this work has really looked at what a 
researcher faces when trying to implement his or her 
ideas on real robots for the first time. In this paper 
we present a number of issues that arise when mak- 
ing the transfer from simulated, theoretical, or single- 
robot research to working with more than one real, 
autonomous, robot situated in a real world. We dis- 
cuss each of these issues in some detail and give sug- 
gestions, based on experience, for dealing with them. 
In the first two sections we discuss the issues related 
to working with more than a single robot and those 
inherent to working with robots situated in the real 
world, respectively. We then describe concrete exam- 
ples of the problems faced when working on multibot 
projects. Throughout the paper we give suggestions 
on what can be done to eliminate or reduce these 
kinds of problems. 

Multibot Issues 

A number of issues arise when working with mul- 
tiple robots. We divide these roughly into the issues 
that arise when looking at the collection of robots 
as a whole, and those issues that arise when looking 
at the individual robots that make up the collection. 
Many of the “collective” issues (such as those that 
deal with communication, organization, cooperation 
strategies, etc.) have been addressed in Distributed 
AI (DAI) research and tend to be fairly abstract in 
their nature. We talk briefly about these issues but we 
do give pointers to where more in-depth discussions 
can be found. Issues that arise from the collection 
of “individuals”, primarily due to the heterogeneity 
between the agents, seems to be a topic of research of 
interest to a great number of fields of study (robotics, 
DAI, artificial life, etc.) but to no one field in par- 
ticular. In this section we discuss the issues that we 
see are the most significant to researchers working 
with collections of these physically embodied agents 
(robots). 
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Heterogeneity 

Heterogeneous het.er.o.ge.ne.ous , adj. Consisting 
of or involving parts that are unlike or without in- 
terrelation; having dissimilar elements; not homo- 
geneous. [< GETERO- + Gk genos, kind, sex.] 
-het’er*o*ge*ne’i*ty n. 

Heterogeneity among a collection, group, or team 
of robots is a BIG issue. In fact, it may be the sin- 
gle most important issue for researchers making the 
transition from simulation or theoretical work to con- 
sider. Research performed in simulation seldom lacks 
the completeness required to fully model the differ- 
ences between robot platforms that will serve as the 
real-world implementation. Very small differences be- 
tween simulated robots, which appear insignificant to 
the uninitiated, can become overwhelming when real 
robots are pressed into service. We identify a number 
of factors to especially watch out for (with respect to 
differences in the robotic platforms) in order to make 
the transition to real robots easier, and to reduce the 
potential impact upon various aspects of the multiple 
robot system if the differences are not eliminated or 
reduced. 

Robots come in an incredible variety of sizes, 
shapes, and capabilities. Robots can be arms, mobile 
bases, gantries, snakes, or any of a number of other 
alternatives. This richness of design makes for a wide 
range of applicability of robots to different domains, 
environments, and applications. It is also a major 
source of grief for anyone wishing to do research with 
more than one of these robots. There are a great num- 
ber of places where heterogeneity can cause problems. 
We divide these into innate and non-innate charac- 
teristics, discussed below. 

Innate 

We consider innate characteristics of a robot to be 
those features that a robot is “born” possessing, those 
which are inherent to a robot’s basic design and is 
generally determined by the manufacturer. These in- 
clude: physical characteristics such as weight, size, 
and shape; precision and/or accuracy of such things 
as odometry, positioning (e.g. robot body, camera, 
etc.); modality and number of sensors; characteristics 
of the low-level control such as dynamics, function- 
ality, interfacing; design limitations and characteris- 
tics such as holonomic characterization, the number 
of degrees of freedom, bounds on speed, acceleration, 
reach, etc., and carrying capacity; and the number 
and type of actuators and/or manipulators. 

Many of these features are either impossible, or 
very difhcult, to change, remove, or replace, and are 
a major boon and bane of robotics researchers. A 
robot that comes with a powerful, flexible, and com- 
plete set of innate “features“ can be greatly advan- 
tageous. And conversely, a poorly designed robot, or 
one that may be designed well but ill suited to the 
task to which is applied, can be a nightmare. 

Non-innate 

We call the features and characteristics of a robot 
which are in the control of the roboticist the non- 
innate features of the robot, those which are a re- 


sult of work done on the robot to add to or change 
the functionality of the robot after it arrives from the 
manufacturer. This includes such things as: the num- 
ber, modality, precision, etc. of sensors; the number 
and type of actuators and manipulators; the number, 
power, memory, connectivity, etc. of processors; the 
programming language; the high-level control scheme 
(if any, which would then include the high-level con- 
trol interface to the low-level controller); the inter- 
agent communications modality and characteristics; 
and “sugar” features like speech synthesis and recog- 
nition capabilities, graphics displays, etc. It might 
also include those innate characteristics that can be 
modified, as there may be some fuzziness to the dis- 
tinction. There is generally a greater variation in the 
non-innate features of a robot than in the innate fea- 
tures, due to the wide range of add-on and upgrade 
possibilities, including “homemade” designs. 

The problems 

Heterogeneity is an inherently multi-agent issue, 
as it is defined as the existence of differences between 
two objects, in this case robots. Heterogeneity arises 
from both the innate and non-innate features of the 
robots, and may be looked upon as an advantage in 
situations where the heterogeneity can be exploited. 
However, the differences between robots can, and usu- 
ally does, eventually cause problems. The problems 
associated with innate and non-innate feature can be 
very similar, but may possibly have very different so- 
lutions (as discussed below). As mentioned earlier, 
heterogeneity between the robots might very well be 
ike most important issue to be faced by researchers 
working with multiple robots. Our empirical intuition 
is that the difficulty of implementing and maintain- 
ing a collection of multiple robots is a function of both 
the heterogeneity and the number of robots. We be- 
lieve that the relationship is something like that of 
Figure 1. As you can see, we believe that the diffi- 
culties associated with increased numbers of agents 
increases at a higher than linear rate. We believe 
the same follows for heterogeneity. Of course this is 
totally unsubstantiated, and is based solely on past 
experience with multibot implementations. 

The problems caused by heterogeneity usually man- 
ifest themselves not in the actual experiments con- 
ducted by the researcher, but in the development 
stage of the research, where the robots are being read- 
ied for the experiments. The development period usu- 
ally serves the purpose of dealing with the differences, 
either to avoid them or to take advantage of them, so 
that when the robots are ready to run experiments the 
issues have already been considered and addressed. 
While designing and implementing the robots’ sen- 
sors, control systems, processing hardware, coordina- 
tion scheme, etc., a researcher may face problems in 
any of a number of areas, which we have divided into 
three broad categories: software, hardware, and func- 
tionality. For each category we describe the source 
of problems that can occur and their effect upon the 
development of a multibot system. 

• Software - Software on the various robots in the 


621 


DWteuHy 



Figure 1: The relationship of difficulty of imple- 
mentation and maintenance to hetero- 
geneity and the number of robots. 


“collection” may be affected by differences between 
any of a number of robotic characteristics, includ- 
ing the processors, compilers, programming lan- 
guages, sensors, speed, development environments, 
and third-party software libraries of the various 
robots. Any difference in these, or any other of the 
innate or non-innate features, may create the ne- 
cessity to modify software to suit a particular robot, 
which will make the robot all the more heteroge- 
neous. Research agendas themselves may force dif- 
ferences in the software systems utilized by different 
robots, such as requiring different control architec- 
tures or obstacle avoidance algorithms, in order to 
study the tradeoffs associated with them. Differ- 
ences in software may range from changed param- 
eters, to modified code, to different software mod- 
ules, to completely different software systems. Re- 
gardless of the source and extent of the heterogene- 
ity, once the differences occur it can be a nightmare 
to make changes across all of the involved robots to 
account for each robot’s idiosyncrasies. 

Hardware - Robot hardware may differ in sensors, 
mechanics, physical dimensions, dynamics, CPU’s, 
equipment storage volume, etc. This may be a 
result of having purchased the robots at differ- 
ent times, implementing different sensor system de- 
signs, replacement of broken equipment with non- 
original parts, etc. Robots that are dissimilar in 
hardware may or may not create problems; If the 
hardware on different robots is not equivalent, in 
that there are enough differences in functional- 
ity, modality, speed, etc. to not be transparently 
switchable, software problems like those discussed 
above will most likely be created. And other dif- 
ficulties may also arise, such as having to gain ex- 
pertise on more and more varied equipment and 
maintaining the various robots’ different hardware. 

Functionality - The capabilities that a robot has 
depends upon the combination of hardware and 
software that it has. Given a robot with a par- 
ticular hardware configuration, the robot can have 
a range of functionality, depending upon the soft- 


ware written to use the hardware. Likewise, given 
control software and sensing algorithms, the robot 
can have varied functionality dependent upon the 
characteristics of the actual sensors, manipulators, 
drive motors, and other hardware that the robot is 
fitted with. Heterogeneity in any aspect of a robot, 
be it sensing, control, motion, manipulation, or 
some other aspect, creates a situation where the re- 
searcher must make a decision about the function- 
ality that he/she wants the robots to actually pos- 
sess. Emphasis may be on having all robots possess 
the same functionality, or it might be desired that 
the robots possess the maximal functionality pos- 
sible. Choosing the latter, while understandable, 
causes more heterogeneity than the former"^, and 
hence possibly exacerbates future problems similar 
those items discussed above. 

Suggestions 

The single most important suggestion that we can 
make to researchers is that they reduce the amount 
of heterogeneity in the robots that they work with. 
Heterogeneity between robots is probably the single 
largest source of problems, effort, and grief encoun- 
tered while working on research. Eliminating all dif- 
ferences between robots would be ideal, of course, but 
is not always possible. Robots from different manu- 
facturers will certainly have differences in innate char- 
acteristics, as will different models of robots from the 
same manufacturer, as will even the same model robot 
from different years. However, these differences can 
be eliminated ai some level of absiraciion^ and it is 
our suggestion that an effort should be made to ac- 
complish this. 

For example, if two robots differ in their low-level 
motion control functions, a set of higher level func- 
tions can be built on top of these commands that 
removes the robot-dependent aspects. Code written 
using this new set of functions can then be readily 
ported between robots. 

Of course, dealing with heterogeneity is an interest- 
ing research topic, and is therefore necessary in some 
situations. But it is our belief that it is much easier 
to introduce differences in robots by disabling func- 
tionality or changing parameters (as examples) than 
it is to eliminate or reduce differences. 

Communication 

When we talk about communication among robots 
we mean the intentional act of trying to convey infor- 
mation. And, while communication may not explic- 
itly be used by some researchers,^’ ^ it is very 
common.^’®’ Communication between robots is 
most commonly accomplished using some form of ra- 
dio frequency (RF) transmission, although it might 

* Unless all the robots are exactly the same in all re- 
spects, so that their maximal functionality is exactly iden- 
tical and all the software and hardware required to reach 
this functionality can also be identical. If the robots are 
not exactly the same, the heterogeneity will show up in 
the software, at least, in order to achieve the same func- 
tionality, if this is even possible. 


eventually become possible to explicitly pass mean- 
ingful amounts of information by visual means. Teth- 
ers or other physical links will most likely not work 
except for robots firmly fixed in place, such as robotic 
arms that are not on mobile bases. 

Communication can be accomplished in a num- 
ber of ways, including simple point-to-point and 
broadcasting (ie. to all robots within range). Com- 
plex multibot communication networks can be con- 
structed, however, where robots may not only act as 
recipients and originators of messages, but also as re- 
lays, helping pass messages between two other robots. 
As more robots interact, communication issues be- 
come more and more of an important issue. 

Communication issues are unique to multiple robot 
scenarios (if only because it generally does not make 
too much sense for a robot to send messages to itself); 
problems related to communications are therefore also 
unique to multiple robots.^ Through our endeavors 
in multi-robot research, we have identified a number 
of the problems that seem to plague communication, 
and we have identified some practical suggestions to 
at least reduce these problems. Some of the more 
significant problems are listed below: 

• Missing messages - messages never get to their des- 
tination. 

• Wrong messages - the wrong message is sent, an 
agent intercepts a message meant for another agent 
and mistakenly takes it to be for itself, or a message 
header gets corrupted in transmission and is sent to 
the wrong agent. 

• Garbage contents - a messages information is cor- 
rupted to the point of uselessness. 

• Communications hardware failure - an agent suffers 
total loss of ability to communicate. 

• Transmission delays - A message’s arrival is delayed 
due to length of travel, number of relaying robots, 
etc. 

All of these problems are caused by RF noise either 
corrupting or overpowering the intended communica- 
tions. The magnitude of the problems one will face 
is directly related to how noisy the RF environment 
is in which the robots will be used, how robust the 
low-level communications hardware and software is to 
corruption (via error checking and correction, hand- 
shaking, etc.), and how robust the abstract coordina- 
tion mechanism is that the robots are using in order 
to work together (higher level protocols, negotiation 
schemes, etc., if used at all). 

Some more abstract issues related to communica- 
tion, many of which have been studied in Distributed 
AI literate, include common knowledge, synchroniza- 
tion, and coherence. Working with real robots means 
that there is always a chance that a message will not 
be received by the intended robot, or that if it is, 

^ Communication from a single robot to a base station 
is pretty common these days, so those researchers that do 
this will have some insight into multi-robot communica- 
tion problems. 


that it is corrupted. Halpern and Moses, prove 
that the involved agents cannot be sure of achiev- 
ing common knowledge about anything that requires 
communication in such situations. Synchronization 
of agents is related to this in that, quite often, com- 
munication is used by the agents to reach a common 
point in time at which they know each other’s “state” 
(and can then go on to perform coordinated activities, 
guaranteed non-interfering actions, etc.)^^’^^ Syn- 
chronization is usually only possible, however, when 
common knowledge of every involved agents’ state ex- 
ists so that they can realize when synchronization has 
been achieved. Coherence deals with coordinating 
agents having compatible and non-contradictory in- 
formation. Coherence can be achieved through com- 
munication of the data itself, supporting or conflicting 
evidence, etc. so that each agent eventually believes 
compatible information. 

Of course, if the interacting robots are unconcerned 
with explicitly coordinating with other agents they 
will most likely not communicate (as in^), and there- 
fore not reason about these communications-related 
issues. 

Suggestions 

Solutions to deal with communication problems are 
pretty commonplace. Technical solutions for these 
problems include retransmission of messages, seman- 
tic message content checks, acknowledged messages, 
periodic confirmation of activity (“Fm alive!”) mes- 
sages, addressed messages, and robust error detection 
and correction protocols, among others. Different 
techniques are necessary for variations of domain, ap- 
plication, robot organization, environment, etc. For 
instance, in extremely noisy environments it might be 
necessary to employ error detection and correction 
mechanisms, retransmission of messages, and hand- 
shaking protocols. When the robots are prone to fail- 
ure, but the environment is noise-free, using simple 
communication protocols might suffice, but periodic 
messages from agents indicating that they are func- 
tioning might be useful. In general, design in com- 
munication overkill. Buy high power, flexible, high 
quality communication hardware. Determine what 
will be the worst possible environmental noise that 
the robots will face, and then employ techniques dis- 
cussed above for environments twice as noisy. 

Planning, Organization, and Task decomposition 

Issues related to planning, organization, and task 
decomposition, among other abstract multi-agent is- 
sues, are beyond the scope of this paper, and much 
research has been conducted on these topics in the 
Distributed AI field. See “Readings In Distributed 
Artificial Intelligence”^ for a collection of papers deal- 
ing in detail with these issues. What should be men- 
tioned here, however, is that each of these issues may, 
in some manner, be affected by the issues discussed 
in the rest of this paper. Some examples are given 
below: 

• Heterogeneity - Multiagent planning routines will 

have to be modified to deal with heterogeneous 
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robots in order to model each of the robots’ indi- 
vidual characteristics. As the robot’s might differ 
in any of a number of ways, this might mean adding 
a large amount of complexity to the planner. Or- 
ganization(s) of robots will be less flexible than for 
homogeneous robots, as each agent may not be able 
to fulfill the responsibilities of every other agent in 
the organization. Much like the planning system, 
task decomposition might be much more complex 
in order to account for the differences in the robots. 

• Communication - Communication issues can be ex- 
pected to work with the equipment and protocols 
that are going to be used, what type of task de- 
composition makes the most sense given the envi- 
ronment and domain 

Real World and Simulation Issues 

Working with robots in the real world may, at first 
glance, seem to be an easy extension of similar re- 
search performed in simulation. And, while simu- 
lators are good for testing theories, debugging de- 
signs, or just running tests due to environmental con- 
straints, a simulator may give the false impression 
that the real World is simple and predictable. How- 
ever, the world is not exact, and is much more com- 
plex then any simulator can realistically model. How- 
ever, they should not be considered satisfactory mod- 
els in which to completely design and develop algo- 
rithms and paradigms destined for real-world robotic 
applications. Toward this end, some researchers have 
totally foregone the use of simulators, and have opted 
to use the world as its own model.^ We should only 
rely on simulators to help us prepare a robot for the 
real world. Though we agree that simulators can be 
a valuable tool to supplement research with physical 
robots,^® at some point in the development cycle it 
will become necessary to make the transition to the 
real world. This may not be such an easy task for 
there are many issues that need to be considered. We 
divide these issues into three categories. These are: is- 
sues that deal with the robot hardware and platform, 
issues that deal with robot sensors and actuators, and 
issues that deal with robot software. 

Hardware 

The first issue, robot hardware and platform, is ar- 
guably the area with the most substantial differences 
between simulation and the real world. Most simula- 
tors do not simulate real world events such as when 
the battery gets low, when computers and sensors on 
a physical robot fail or start to misbehave (usually 
without one being aware of the fact), when motors 
degrade or burn out, or when gears or wheels slip or 
break. There is a great amount of hardware, all of 
which, usually, has to work flawlessly. There is hard- 
ware to control the motors, hardware to control the 
sensors, and usually a central computer. There is also 
minor hardware appliances such as batteries, power 
converters, actuator circuitry, and monitors that may 
need to be on board. 

An issue that may be overlooked quite often is the 
actual space requirements of sensors and other hard- 


ware on a robot. While a simulator may simulate 
the functionality of each of the physical components, 
it probably does not simulate how to place all these 
components onto a robot base while keeping it sta- 
ble, usable, and accessible. Robots in a simulator will 
have an array of simulated sensors, perhaps covering 
a gamut of modalities such as sonar, vision, range 
imaging, structured light, and infrared. The robots 
may have to communicate with each other. They may 
have manipulators of various sorts. A simulator may 
permit a all of these capabilities on a robot without 
consideration to the practicality of doing so. 

The type of robot platform or base that the robot 
is built on will is another important issue to consider. 
If a base is purchased from a manufacturer, one may 
not need to worry about the base design, but one 
does need to worry about the ability to add hard- 
ware and software to the robot (i.e. change the non- 
innate features). Consideration must also be given 
to such details as the base drive design, which can 
take a wide variety of forms, from differential drive 
or synchro-drive, to a tricycle-like or car-like design. 
The different drive systems may mandate some design 
decisions on the type and use of sensors, the type of 
robot control software used, the planning system, etc. 

Power constraints are another serious issue often 
overlooked when dealing with real robots. A great 
deal of a robot’s life is spent charging its batteries, 
the more so if it is heavily loaded with sensors, ac- 
tuators, and other electrical equipment. Also, the 
size and number of batteries restricts the amount of 
power (current) available, strictly limiting the amount 
of electrical equipment that may run concurrently on 
the robot. Even when within this limit, the battery 
life of a robot is determined by the load on its batter- 
ies while running. A robot that must operate over a 
long period of time must therefore have a light com- 
plement of electrically driven sensors, actuators, and 
other electrical equipment. 

Sensors 

The second issue to consider are the sensors that 
a robot might use in the real world. One must con- 
sider the kinds of sensors that will be necessary for the 
robot to perform its task and how many sensors the 
robot is going to need. A simulator can only approx- 
imate the data returned from a sensor based upon 
its internal sensor model. The more accurate or com- 
plex the sensor , the more sophisticated the model will 
have to be. As an extreme point of view, it may not be 
possible to realistically simulate real sensors (except 
perhaps extremely simple sensors like limit switches) 
in a simulator due to the complexity and uncertainty 
of the real world. No one can can anticipate all of 
the possible ways that a sensor may fail and/or err. 
Problems that can (and do) occur include cameras 
that are not calibrated to specifications, lenses that 
are dirty or misaligned, sonars with shorted circuitry, 
incorrect sensor values due to low input voltage, and 
motor decoders that perform differently than speci- 
fied or degrade in performance over time. 

Another issue to consider is whether the sensors 
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modeled in simulation return realistic data. For ex- 
ample, a simulated robot may have a sensor that can 
return the identity of a nearby human. Rut, one must 
ask, is there a sensor in the real world that can do 
that? Perhaps, but probably only at the cost of a 
great deal of design and implementation effort, and 
probably one that is quite fallible. 

Software 

It may be quite a formidable task to track down 
a software error on a real robot. In a simulator, the 
only place an error would normally arise would be in 
the user developed code. In the real world, one also 
has to consider that an error that might cause the 
robot to fail or perform differently from the simulator 
may be the result of a sensor failure, low battery, com- 
puter failure, cable problem, or some other seemingly 
unrelated cause. 

Depending upon how carefully the software was de- 
signed with the real world in mind, the software run 
in simulation will most likely run in the real world. 
When tested in the real world, however, one may find 
that algorithms need to be changed, libraries modified 
or rewritten, parameters changed, etc. When making 
the transition to real robots, one must also consider 
the size of the software and speed of the robot’s pro- 
cessor. Porting code to a minirobot will require great 
creativity if the software was developed on a Cray su- 
percomputer and requires 128 megabytes of memory. 
Due to the increased complexity of the real world with 
respect to that in a simulated model, robot tasks will 
always be harder than that faced in a simulator. It 
may require more subtlety or complexity in the plan- 
ning and control software, more sensors and compu- 
tational resources, or integration of software not in- 
cluded in the simulator model, to accomplish even a 
simple task. 

Suggestions 

To ease the transition of a robot design from a sim- 
ulator to the real world we offer several suggestions. 

♦ Expect that some additional code will be needed 
on the physical robot. Some helpful test programs 
that will test each component to assure that it is 
working properly, whether it is a sensor or a sub- 
routine, will be useful. After three weeks of running 
a camera, for example, it may have been bumped 
out of alignment or had its aperture closed so that 
it is causing strange and unexpected results in the 
robots behavior. Hardware problems can have very 
misleading symptoms and it is good to have test 
programs that can easily rule out simple causes. 
And always check connectors, as they are apt to 
become disconnected. This is a very common prob- 
lem, as everything on the robot uses cables (cam- 
eras, disk drives, monitors, keyboards, etc.) and 
simply the vibration caused by the robot moving 
around is enough to loosen cables. 

• A useful place to look when a problem arises is 
the battery. A low battery may cause errors that 
did not previously exist and be of a type that has 
never been encountered before. A robot may work 


correctly for months and then develop unexpected 
errors because the battery is running low. 

• It always helps to work in stages. Test eaph compo- 
nent and routine as it is moved from the simulator 
to the real robot(s). 

• Document the errors that are encountered. One 
does not want the next person working on the 
robot(s) to repeat the same mistakes. 

• Lcist of all, always have a remote emergency robot 
stop button handy, especially for large robots. 
Some robots can move very fast and weigh several 
hundred pounds, and may unexpectedly go out of 
control. 

It may seem that the task of transitioning a robot 
design from a simulator to the real world will be quite 
simple and straightforward. There are many issues 
that need to be considered. There will always be un- 
expected problems to deal with, but a user that is 
prepared and considers the issues mentioned above 
might ease the transition. 

Examples 

We recently worked on a couple of projects that 
involved the cooperative interaction of two heteroge- 
neous indoor mobile robots. One project, involving 
exploration and navigation of an office-like environ- 
ment, was implemented with some care and consid- 
eration to the issues described in this paper, but was 
so beset by problems that it failed to be completed 
within a hard time deadline and was never fully im- 
plemented. Another project, where the robots had to 
push boxes across an obstacle strewn floor, was im- 
plemented extremely quickly and serves as an excel- 
lent example of where failure to implement according 
to the suggestions above resulted in a very brittle, 
failure-prone, and frustrating multibot system but 
which, with some effort to resolve the problems us- 
ing the suggestions in this paper, can become much 
more robust and successful. 

Dynamic Duo 

One of the two robots involved in these projects is 
CARMEL, a mobile robot based upon a Cybermo- 
tion K2A mobile platform. BORIS, the other robot 
used in the projects, is based upon a TRC Labmate 
platform. These robots are shown in Figure 2. The 
major innate difference between the platforms lies pri- 
marily in the motion characteristics - CARMEL is 
essentially holonomic, being able to move in any di- 
rection at any time without first having to turn its 
body, while BORIS must first turn to face the direc- 
tion of travel before moving. Hardware differences 
other than those of the platform itself consisted of 
substantially different sonar rings (CARMEL has a 
circular ring of sonars, BORIS only has sonars facing 
in the forward direction), different CPUs (not only do 
they have IBM 80486 compatibles with differing char- 
acteristics, but CARMEL had an IBM XT compatible 
running the sonar system that BORIS does not have), 
and differing vision systems (CARMEL has a camera 
mounted on an independently rotating table, while 
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Figure 2: CARMEL (left) and BORIS (right), 
robots used in several multibot projects. 


BORIS’S camera is fixed to BORIS’S top facing di- 
rectly ahead of BORIS.) Low level functionality, was 
similar, but the robots had different implementations 
of some basic functions, such as obstacle avoidance. 

Exploration 

One of the events in the National Conference of 
Artificial Intelligence’s ’93 robot competition was to 
autonomously explore an “office” environment look- 
ing for a visually tagged object, which was then to be 
delivered to a predetermined room in the office com- 
plex. Walls in the “office” were three foot high panels 
of sonar-reflecting plastic and arranged to form rooms 
and halls. Each entry’s robot was placed within the 
office without knowing where within the office it was 
or in which direction it faced. However, the robot(s) 
were told in which quadrant of the office environment 
they started. And each robot Wcis given mostly ac- 
curate knowledge of the office layout with respect to 
wall placement and metric measurements; the actual 
office layout could differ from the map in that some 
doors could appear where not indicated on the map, 
and some doors on the map could disappear. 

While most entries were single robot approaches, 
which we also tried with CARMEL, we attempted a 
multibot approach (both BORIS and CARMEL) as 
we saw a distinct advantage in exploring the office 
in parallel with two cooperating robots. However, 
while CARMEL was already a fully autonomous mo- 
bile robot, with obstacle avoidance and vision systems 
and a great deal of experience in research and robot 
competitions,"^ BORIS was initially only a bare TRC 
Labmate robot base upon which a small amount of 
obstacle avoidance research had been performed. We 
realized from the onset that we should attempt to de- 
sign BORIS to be as functionally similar to CARMEL 
as possible. 

A great number of engineering changes had to be 
made both in hardware and software to accommodate 


the many differences in the two robots. An exam- 
ple is the feature detection algorithms used to detect 
the various configurations of “office” halls and open- 
ings. The same basic sonar-based obstacle avoidance 
system was pre-existent in the robots before devel- 
opment started, so that including algorithms to per- 
form feature detection seemed straightforward. The 
two robots had very different implementations of the 
sonar system, however, which made porting of the al- 
gorithms developed on one robot to the other robot 
extremely time consuming and is perhaps the single 
most important factor in the difficulty we had im- 
plementing the multibot approach. In addition, the 
sonar hardware differences (complete vs. partial rings 
of sonars) require BORIS to rotate in certain situ- 
ations when CARMEL does not have to, requiring 
modifications to some low-level functions as well as 
the planner. Parameters in the sonar system also had 
to be fine-tuned to each robot to account for differ- 
ences in the robot’s size, sonar filtering system imple- 
mentation, and other factors, and we were never able 
to perfectly match the results of the two robots.^ 

Box pushing 

The box pushing task at the AAAI ’93 robot com- 
petition involved locating boxes (marked with distinc- 
tive visual tags) and moving them into a predefined 
pattern while avoiding obstacles (boxes that were not 
to be moved). Because a robot pushing a box cannot 
“see” with its sonars and therefore cannot perform 
obstacle avoidance, this meant that one robot either 
scout out a clear path beforehand and then go back 
and get the box, or that two robots help each other, 
with one robot acting as the navigator and one robot 
pushing the boxes. We chose to implement the multi- 
bot design, which made the design more challenging, 
difficult and, as we found out, frustrating. 

Our approach to the task was to use CARMEL 
as the “Boss”, or navigator, and BORIS as the 
“Worker”, or box pusher. This task assignment was 
made because BORIS is built upon a square base, 
making it more conducive to pushing boxes than 
CARMEL, which has a cylindrical base. It was im- 
portant that each robot have some shared knowledge 
of the world to properly navigate around the arena; 
for CARMEL to act as the navigator and tell BORIS 
where to push boxes, it was necessary that each robot 
initially knew where the other one W 2 ls located in a 
global coordinate system. Each robot also had an in- 
ternal map of the arena indicating the boundaries and 
the single interior wall. Obstacle and object locations 
were not known beforehand. 

The interaction of the robots in this task Wets de- 
fined as follows: 

The robots first job was to synchronize themselves 
using a sequence of handshaking messages. Once syn- 
chronized, BORIS would drive to one of a number 

^Further ramifications of the difficulty experienced 
with implementing the multibot approach was that devel- 
opment of the single robot technology was slowed a great 
deal due to the “thinned” person-power of trying to bring 
two robots up to competition speed instead of only one. 
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of predetermined viewing positions in the arena and 
look for boxes. If it found one (or more) it would ap- 
proach it, stop just before the box, and communicate 
its location and orientation to CARMEL. Meanwhile, 
CARMEL would be waiting at its initial location until 
it received this message. CARMEL would travel to a 
location in line with the goal point where the box was 
to be dropped off and a few meters in front of BORIS’S 
position. CARMEL would then travel in three me- 
ter segments to the goal point while avoiding obsta- 
cles. At each of these via-points, CARMEL would 
send its current position to BORIS as an obstacle- 
safe position. BORIS would attempt to move between 
these points in such a way as to keep the box securely 
in front of it (using relatively slow, wide turns) and 
would “follow the leader” to the point that the box 
would be placed. Because CARMEL moved in such 
small increments, the hope was that BORIS’S path 
between via-points would keep it away from obsta- 
cles. Once CARMEL reached the box dropoff loca- 
tion, it would then move a safe distance away and 
wait for BORIS to find another box. BORIS would 
continue to push the box until the final location was 
reached. Free of the hindrance of the box, BORIS 
could then use its own obstacle avoidance system to 
move to the next viewing position and search for the 
next box. 

As expected all did not work out the way it was 
planned. There were some problems encountered, 
some with BORIS, some with CARMEL, and some 
with the cooperative aspects of the task. BORIS’S 
problems began with its vision system. On the initial 
attempt, the camera BORIS was using to locate boxes 
was pointed too low. As it turned out, the boxes we 
used for testing the robots were upside-down com- 
pared to the ones actually used in the arena, causing 
the tags on the boxes to be higher than the cam- 
era could see. BORIS moved from box to box, not 
recognizing anything, while CARMEL sat motionless 
waiting for a message from BORIS. After approxi- 
mately five visual scans, BORIS suffered an unex- 
plained lockup and we had to restart. We fixed the 
camera angle on BORIS and tried again. 

On the second run, BORIS located the box cor- 
rectly, approached it and transmitted a message 
to CARMEL communicating he was ready for the 
‘bosses’ orders. CARMEL’s map had been initial- 
ized improperly, so CARMEL thought that it was on 
the opposite side of the arena. It moved to the cor- 
rect location in its own map where it thought BORIS 
was, not to where BORIS actually was, and plotted 
out a path to the goal position. The map error re- 
sulted in a large discrepancy in positions, however, 
and the resulting path was useless. Unaware of this, 
CARMEL then sent the command to BORIS telling 
it to proceed. CARMEL proceeded to move along 
its planned path toward the phantom goal destina- 
tion, sending position messages that were uncorre- 
lated with BORIS’S map. BORIS, also unaware of 
the problem, started to execute the navigation path 
transmitted from CARMEL, but started to drive in 
the wrong direction and we had to start over again. 


On the third and final run, the situation improved 
slightly. The robots synchronized, BORIS immedi- 
ately found a box, CARMEL acknowledged the com- 
munications, moved in front of BORIS and the box, 
and traversed an obstacle-free path for BORIS to 
travel. CARMEL moved a safe distance away from 
the box drop off position and waited for BORIS to 
complete its pushing. However, at some point in 
the transmission of the initial path via-point from 
CARMEL to BORIS the message was corrupted and 
values for parameters were radically in error. This 
caused BORIS’S control program to crash and hang, 
leaving both robots hanging. 

While implementing the design for this project we 
ran into a number of issues discussed in this paper. 
Because we were able to actually implement and per- 
form this task, we encountered both issues like those 
encountered during the exploration task as well as 
run-time and coordination issues that we did not have 
an opportunity to discover in the abortive attempt 
at the exploration task. Communication related is- 
sues figured quite prominently among those we ran 
into. Both during development and actual competi- 
tion runs we experienced delays, losses, and corrup- 
tion of messages sent between the robots. Because of 
the short time period in which we implemented our 
design (approximately 24 hours), we were unable to 
build in many of the safeguards recommended above. 
We successfully synchronized the robots using a set 
sequence of messages between the robots. We did 
not, for example, implement any form of semantic 
contents checking to detect invalid messages. Nor did 
we implement high-level retransmissions of messages 
if an expected response from the other robot never 
arrived. Heterogeneity in hardware did not surface 
as a significant issue for this task, primarily because 
we used the robot’s heterogeneity to best advantage, 
we were not trying to achieve equivalent functional- 
ity, and the robots were to work have different task 
responsibilities. Software was more of an issue for the 
exact same reasons; we had to develop entirely dif- 
ferent control software for each robot and could not 
develop software on one robot and port it to the other. 

Future Work 

The examples above give vivid accounts of the 
problems likely to be faced by researchers working 
with multiple real robots. We continue to be inter- 
ested in multibot systems and applications, despite 
the extra effort that such work entails. The next 
major project is to develop a team of cooperating 
outdoor robotic vehicles (new military jeeps called 
HMWMMV’s, or “Hummers”) that can perform a 
task such as forward reconnaissance. We have de- 
signed and built one prototype vehicle (MAVERIC) 
based upon an electric utility cart, with which we can 
do development and experimentation without requir- 
ing access to the large and costly Hummers. When 
building another vehicle - to develop the multiple 
robot coordination technology necessary for this type 
of task - we will pay great attention to the issues 
raised in this paper. First and foremost, we will 
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try to make the two vehicles as identical as possi- 
ble. This will most likely entail outfitting the second 
vehicle in exactly the same manner as MAVERIC in 
many aspects. It will also probably require upgrad- 
ing MAVERIC with improvements, based upon our 
experience with MAVERIC and discovery of short- 
coming in its design or implementation, that will be 
implemented from the onset on the new vehicle. 

We would also like to explore using minimal re- 
sources on multiple minirobots to perform coopera- 
tive tasks. The robots that we are using are based 
on the MIT miniboard and use an MC6811 micro- 
controller. While computing power and controlling 
software are the same for each of the robots, there 
may be some small variations in the bctses and sen- 
sors. The general idea is to use the ideas we have 
discussed here to help design robust multiple robots 
and apply them in the real world. 

Conclusions 

A great deal of research hets been performed re- 
garding how we can get multiple agents cooperating 
together in wondrous harmony. Much of this research 
has not, as yet, involved working with the agents that 
will actually end up doing the work, robots. A re- 
searcher who has not already attempted this technol- 
ogy transfer is in for a lot of headaches unless he or she 
has some insight to the issues that are associated with 
such a process.. We have introduced and discussed 
the most significant issues and their causes, and have 
given many suggestions on how to deal successfully 
with them. A great deal of work and aggravation can 
be avoided by paying attention to these issues before 
implementation of a system on real robots. We have 
described the problems and solution faced when im- 
plementing two multibot projects in particular in or- 
der to fully illustrate what might occur during imple- 
mentation, and accentuate the importance of paying 
heed to the issues raised in this paper. 
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Abstract 

The application of telerobotics to aircraft de- 
pot maintenance and remanufacturing is described 
and a telerobotics architecture for the application 
is discussed. Telerobotics will enhance process 
quality and could potentially decrease turn-around 
time and costs while moving human operators from 
hazardous work areas to safe and comfortable op- 
erator control stations. The approach is to aug- 
mentj not replaccy the human operator by blending 
human skills with robotics capabilities. Configura- 
tions of the architecture for telecranCy mobile car- 
rier, and gantry applications are shown. 

1> Introduction 

This paper summarizes a study performed by 
the Jet Propulsion Laboratory for the Air Force 
Material Command (AFMC) Robotics and Au- 
tomation Center of Excellence (RACE) to eval- 
uate the feasibility of telerobotic solutions to C- 
5A heavy lifter aircraft maintenance processes and 
develop a telerobotics architecture for the appli- 
cation [1]. The architecture was developed for 
general depot maintenance and remanufacturing 
applications and applied to the C-5 A application. 
Several implementation options suitable for inser- 
tion into a variety of depot applications that sup- 
port the C-5A heavy airlifter are described. 


The Aircraft Directorate at the San Anto- 
nio Air Logistics Center (SA-ALC) is responsi- 
ble for depot level maintenance on the C-5 air- 
frame. The efficiency and productivity of many 
of the required repair processes will benefit from 
the insertion of telerobotics technologies. Small 
batch size, feature uncertainty, and varying work- 
loads make hard automation impractical for a wide 
range of depot processes. Systems are needed that 
can bridge the gap between manual control and 
complete automation. Supervised autonomy and 
shared control technologies provide intermediate 
solutions where the human and machine collabo- 
rate to perform tasks. In supervised autonomy, 
robotic tasks are interactively developed by the 
operator and then executed autonomously [2]. In 
shared control, control inputs during task execu- 
tion are provided both by an operator, e.g., using a 
hand controller, and an autonomous system [3]. A 
more complete description of telerobotics systems 
can be found in [4]. The goal is to augment, not 
replace, the human operator by blending human 
skills with robotics capabibties. 

Aircraft depot maintenance and remanufac- 
turing provides a wide range of challenges for 
robotics. The physical scale of the appbcations 
includes stripping paint from a C-5 heavy lifter to 
remanufacturing small individual parts. The parts 
generally arrive individually or in small batches 
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and a wide variety of parts are remanufactured. 
Due to the wide variety and scale of the appli- 
cations, the maintenance and remanufacturing is 
now done almost exclusively manually. Example 
depot applications which the architecture must ap- 
ply to include: painting of the C-5A exterior in 
a dedicated hanger facility; painting of removed 
piece parts in a robotic workcell; stripping of paint 
from the C-5A exterior in a dedicated hanger facil- 
ity; surface finishing in form of removing material 
from patches and polishing metal to a high gloss 
finish in a robotic workcell; Surface cleaning of re- 
moved parts in a robotic workceU through appli- 
cation of bicarbonate of soda particulate stream; 
and sealing and desealing of aircraft fuel tanks. 

It is expected that telerobotics can provide 
many benefits to aircraft depot maintenance and 
remanufacture. Limited manpower resources limit 
the number of aircraft that can be remanufac- 
tured. Telerobotics can augment the productiv- 
ity of operators allowing a greater rate of aircraft 
throughput. In many instances telerobotics can 
provide better process control, e.g., paint can po- 
tentially be sprayed on an aircraft more uniformly 
than by an operator leading to reduced average 
thickness and cost savings in paint and aircraft 
weight. There are various hazardous work situa- 
tions and environments in aircraft depot mainte- 
nance and remanufacturing areas including: chem- 
ical contaminants in the air and on shop surfaces; 
handling large, bulky support equipment; exces- 
sive vibration, especially of hand-operated equip- 
ment; and excessive atmospheric heat and humid- 
ity (up to 100 deg.F, 95% humidity). Telerobotics 
allows placing a manipulator in the hazardous en- 
vironment and moving the operator to a safe and 
comfortable operator control station. Addition- 
ally, there are tedious applications which cause fa- 
tigue and subsequent errors, e.g., paint stripping 
and deriveting. Many of these tasks can be accom- 
plished with the operator supervising a telerobotic 
system to perform the task resulting in greater ef- 
ficiency and quality. 

Since the telerobotic architecture was de- 
signed for use across a wide variety of depot air- 


craft and m^aintenance and remanufactiiring appli- 
cations, there are a large number of requirements 
it must satisfy. The architecture must accommo- 
date different types of robotic manipulators with 
varying degrees of freedom with modular changes 
only to interface code. It must accommodate dif- 
ferent types of transport and positioning devices 
for robots and piece parts with modular changes 
only to interface code. Initialization and monitor- 
ing must be automated and rapid. Human oper- 
ations shall be able to safely operate within the 
range of motion of most manipulating and posi- 
tioning devices through built-in safety protocols 
(hardware, software, and/or procedural) Smooth 
transitions to manual workaround modes must be 
possible during automation downtimes for main- 
tenance, upgrades, etc. The architecture must 
accommodate different, unmodeled parts in all 
piece part applications. Software and hardware 
upgrades shall cause minimal down time. 

2. Example Application: C-5A Aircraft 
Maintenance and Remanufacturing 

The remanufacturing processes that support 
depot level maintenance of C-5 aircraft are repre- 
sentative of a wide variety of dual-use applications. 
Applications include stripping the external surface 
paint and then repainting, painting removed parts 
in a robotic workcell, skin repair, surface cleaning 
of removed parts through application of bicarbon- 
ate of soda particulate stream, surface finishing for 
patches, and polishing metal surfaces. A unique 
aspect of working on large airframes (the C-5A is 
over 247 ft. long and with a wingspan over 222 ft) 
is the requirement for large positioning systems. 
Several alternatives are possible. The first option 
is the telecrane concept where a special facility 
provides telecranes upon which the manipulators 
are mounted, as shown in figure 1. Such a tele- 
crane facility is presently used at Kelly AFB which 
positions human operators around the aircraft for 
servicing applications (paint stripping with plas- 
tic beads). The telecranes do not have positioning 
sensors so either positioning sensors would have 
to be added, or some other method would have to 
be used to determine the position of a manipula- 
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tor mounted on a telecrane. A second approach 
is to use mobile carriers where manipulators are 
mounted on mobile bases and the mobile bases 
are capable of being positioned around the air- 
craft, cts shown in figure 2. Another option is to 
use an overhead gantry system where manipula- 
tors are mounted on mobile gantries. These trans- 
port methods apply to tasks which are done on 
the physical aircraft structure, such as painting, 
but there are also many tasks which are done on 
piece parts in separate workcell rooms such as re- 
pair and painting. 

3. Telerobot Architecture 

A telerobot architecture was developed to 
provide a near-term solution for implementation 
of a telerobotics system for C-5A servicing. Var- 
ious architectures were evaluated such as the 
DOE GIST architecture [5], the NIST NASREM 
architecture[6], and the NASA/JPL local-remote 
architecture[2]. The architecture developed here 
has ideas common to aU of these architectures. 
The CISC architecture provides the important 
concept of intelligent subsystems. The NASREM 
architecture provides valuable contributions in the 
coordination of task decomposition, modeling, and 
sensing. The NASA/JPL architecture provides 
the valuable concept of independent data-driven 
software modules to collectively provide general 
task execution capability. 

The architecture developed for aircraft main- 
tenance and remanufacturing is shown in figure 3. 
The architecture is nominally separated into lo- 
cal and remote sites corresponding to the location 
of the operator and robotic systems, respectively. 
The actual computing resources can be physically 
located near the operator, robotic system, or sep- 
arate from either. The primary constraint is that 
sufficient communication bandwidth is provided. 
The basic concept of the near-term system is that 
there exist subsystems which have sufficient inher- 
ent capability to execute a wide range of task types 
either independently or in coordination with other 
subsystems. A task program is generated by the 
local site which describes the task to be executed 


either by an independent subsystem or through 
coordination of subsystems. The task program 
can be executed in various ways depending on the 
level of capability of the coordinating subsystems. 
The desired solution is to allow distributed au- 
tonomous control of the coordinating subsystems 
by separating the task program into subsystem 
task programs. The subsystem task programs can 
then be executed by a task program sequencer, 
possibly at the local site operator control station, 
or sent to the subsystem controller for execution 
within the subsystem controller, if possible. Sub- 
system inherent capabilities are programmed off- 
line so that during task setup and execution the 
subsystems already have the necessary inherent 
task execution capabilities. 

Various maintenance and remanufacturing 
scenarios provide a poorly structured environment 
so that sensing the environment is necessary to 
generate or update a model of the environment. 
For example, neither the manipulators nor the air- 
craft wiU be positioned accurately to a well known 
location a priori to task execution. Before, or dur- 
ing, task execution, the relative positions of the 
manipulator and aircraft area of interest must be 
determined. A main object knowledgebase is pro- 
vided which stores global state information. Each 
subsystem also has its own database which in- 
cludes relevant information from the object knowl- 
edgebase and information generated from sensing 
the environment during task execution. The ob- 
ject knowledgebase and subsystem database are 
kept consistent for common information. Envi- 
ronment modeling can be done in various ways. 
Autonomous subsystem tasks can include, or have 
primarily, modeling elements. Alternatively, the 
operator can interact with the system to aid in 
developing models of the environment. For appli- 
cations which require highly accurate positioning, 
such as deriveting, it is likely that either sensor 
based position servoing or shared control will be 
necessary. An a priori generated model of the rivet 
pattern is unlikely to have the accuracy relative 
to the real rivet pattern that would be necessary 
for rivet removal. Sensor based position servoing 
would likely utilize real-time vision with an arm- 
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Figure 3: Telerobot architecture for aircraft maintenance and remanufacturing 




















mounted camera. A proximity sensor and possibly 
a sensor to measure surface tilt might also be used 
concurrently to control the position of the manip- 
ulator relative to the target. Rivets are difficult to 
find autonomously since the rivets have approxi- 
mately the same color a.s the skin. Also, for pre- 
viously repaired skin sections, the rivet pattern 
may not be known a priori. Therefore, for rivet 
removal, the operator can facilitate the use of the 
automated vision and sensing system by designat- 
ing the rough location of the rivets to be removed. 
A video image of the skin section is provided on 
a monitor for the operator. If a model of the skin 
section is available, then it is overlayed on top of 
the video image (a ghosted image or perhaps wire 
frame). Otherwise an approximate model of the 
skin is generated to provide a three dimensional 
surface upon which to designate rivet locations. 
The operator then utilizes an input device to move 
a cursor to the rivet locations seen on the video im- 
age and selects the rivets to store their locations 
in the object knowledgebase. These approximate 
locations can then be used as starting locations for 
the automatic sensor based localization later. It is 
often useful in the task programs to specify ob- 
jects and locations symbolically rather than with 
absolute locations. Then the task program can be 
generated independently of the actual locations. 
The actual locations of objects can be generated 
later either independently and stored in the ob- 
ject knowledgebase or as part of the task where 
operator input is automatically requested. Shared 
control can also be used to specify destinations. 
Here the operator uses an input device such as a 
six DOF hand controller to position the manipu- 
lator above the rivet. The proximity and/or tilt 
servoing could be occurring simultaneously to con- 
trol the distance to the surface, depending on the 
method for removing the rivets. In this case the 
operator replaces the vision system. 

It is desired that task description be as sim- 
ple as possible for the operator. Therefore, as 
much intelligence as possible is designed and pro- 
grammed into the system. For a sophisticated 
implementation, the operator would provide high 
level goal based information and the system would 


autonomously generate the associated task pro- 
grams. A more realistic near-term system would 
require greater interaction with the operator to 
develop a new task. It is desired that the opera- 
tor interact with the system primarily within the 
video/graphical environment, i.e., in a telepres- 
ence sense, both for task description and task ex- 
ecution. For task description, the operator would 
move the graphical manipulators via an input de- 
vice such as a six DOF hand controller. The ob- 
jects to interact with could be selected directly, or 
implied by proximity or context. The tool which 
the manipulator is carrying, along with the pre- 
vious task steps and the selected object, provide 
a large amount of context information which the 
system could use to automatically suggest to the 
operator, or select, the next action to take[2]. The 
actions could be the subtask segments from the 
task knowledgebase. 

The remote site subsystems will vary in the 
types of systems which they will control, in capa- 
bility, and in vendor source. For some subsystems 
the task program will have to be translated into its 
command language. For other subsystems, a task 
program might be used directly. There are several 
types of control and coordination which may be 
needed within subsystem control and between sub- 
systems. Closed loop control implies that there is 
a close coupling between sensory data and control 
commands to the devices. One subsystem provides 
cooperative control of its associated devices. Mul- 
tiple subsystems can be coordinated to achieve a 
task goal. 

Configurations of the architecture shown in 
figure 3 for telecrane, mobile carrier, and gantry 
applications are shown in figures 4- 6. 

4. Evolution of the Telerobotic 
Architecture 

The architecture shown in figure 3 supports 
near-term system development and evolutionary 
growth. Most of its basic features can be provided 
by existing vendors of automation and robotics 
technology. One drawback of current technology 
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Figure 4: Focused Telecrane Control Architecture 
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Figure 5: Focused Mobile Carrier Control Architecture 
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Figure 6: Focused Gantry Control Architecture 

























is that it is difficult to integrate systems from dif- 
ferent vendors when a significant amount of con- 
trol and modeling information is passed between 
layers in the architecture, since this information 
is often stored in different formats. The evolu- 
tionary direction of the architecture is to provide 
subsystems with increasing levels of intelligence 
which can be provided with goal based informa- 
tion rather than control based information which 
is prevalent with current technology. The intel- 
ligent subsystems would then autonomously re- 
quest resources from other parts of the system such 
as the object knowledgebase. The resulting task 
programs would then be significantly smaller and 
quicker to generate. Protocols for communicating 
requests and information between the subsystems 
need to be developed. This approach is consistent 
with the goals of the Next Generation Controller 
program [7] which is developing a similar architec- 
ture for machine tool control. In the next year this 
effort wiU work more closely with the NGC effort 
to attempt to develop common interfaces and a 
common evolutionary architecture. The operator 
remains an integral part of the evolutionary in- 
telligent architecture. In such an architecture the 
operator could become one subsystem with mul- 
tiple capabilities or could be modeled as multiple 
subsystems. Also, the operator could act as one 
part of one of the subsystems such as the c^se de- 
scribed above where the operator performed the 
visual servoing for rivet localization. The system 
would then request input from the operator for 
information it cannot generate automatically, just 
as it would query one of the other parts of the 
system. 

5. Conclusions 

Application of telerobotics to aircraft depot 
maintenance and remanufacturing was discussed. 
The requirement to reduce technology insertion 
and system life cycle costs mandated the design of 
a generic architecture which can be implemented 
in the near-term and and still provide an evolu- 
tionary growth path. Most of the basic features 
of the near-term architecture are available from 
existing vendors. The evolutionary architecture 


utilizes increasing intelligence in the various mod- 
ules of the system resulting in a more distributed 
autonomous control system. A commercialization 
study is underway. 
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Ab^act 

This paper presents a new method of automatic 
path planning and trajectory generation for robotic 
surface finishing. Initial development is on a 
platform integrated from commercially available 
components including AutoCAD and a low cost 
CMM. Automatic program execution is 
demonstrated using a 6 DOF manipulator on a 
variety of complex shaped surfaces. 

Introduction 

The focus of research described in this paper 
was determined by the needs of end-users associated 
with The Automated Surface Finishing Consortium. 
The mission of the Consortium is to develop 
finishing automation as an end-user technology for 
U.S. manufacturers. Members are military and 
commercial manufacturers, universities, federal labs, 
and commercial suppliers of abrasive process 
technology and robotics technology. 

An objective measure of technology 
development/deployment impact is the availability of 
successful commercial off-the-shelf technology. The 
synergistic interactions of the Consortium members 
have yielded impressive results to date. Force 
controlled robot end effectors commercialized by 
collaborating members have significantly broadened 
the boundaries of available compliant finishing 
process capability. A generic off-the-shelf integrated 
robotic finishing system, the product of another 
collaboration, is now distributed and supported by 
the largest robot company in the world. Likewise a 
low cost high performance simulator based surface 
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path programming aid is now a commercial product. 
ARRI is host to the Consortium, and a member. 

Background 

Most material forming processes do not 
produce finished parts. Most machining operations 
leave burrs and sharp edges. Large aircraft wing 
skins are milled by three axis terrace cutting leaving 
small steps which must be blended to prevent 
fatiguing stress concentrations. Complex curved 
surfaces like ship propellers and aircraft landing 
gear are machined with rounded milling tools which 
leave a pattern of tool marks which must be ground 
off by hand. Cast parts require gate and sprue 
removal and deflashing. When castings are 
machined they require deburring. Many parts must 
have their surfaces conditioned for appearance or 
subsequent plating and coating operations. The 
stamping and forging of automobile door panels and 
engine components leave "imperfections" which are 
finished out by hand. Die cast surfaces of hardware 
for door handles and hinges are ground and 
polished. Wrenches, golf clubs, vacuum cleaners, 
furniture, boat hulls, all have to be ground, sanded 
and polished to have a nice appearance. 

The costs of manual finishing are high. A 
recent report by the National Center for 
Manufacturing Sciences suggests that direct 
finishing costs exceed 25 billion dollars annually 
based on mechanical manufacturing industry gross 
sales of 1,000 billion dollars^ . Often in precision 
parts manufacturing and particularly with complex 
shaped parts there is high finishing labor content. 

Jet engine component manufacturers in the U.S. and 
Canada report that between 15% and 30% of the 
direct cost content of their products is finishing 
labor. A major consequence of manual part 
finishing is scrap and rework. Although difficult to 
quantify, interviews of finishing workers in high 
value added manufacturing invariably reveal costly 
scrap and/or complicated expensive rework 
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procedures. This is a direct result of manual 
finishing errors. 

Most of the finishing laborers are grinding, 
filing, sanding and scraping parts by hand. Some 
finishing jobs are high paying, requiring artisanship 
based on years of practice. Most finishing jobs are 
low skill low quality jobs, typically they are the least 
desirable and lowest paying. The reason is that 
manual finishing work is drudgery of the worst kind, 
exposing workers hands and arms to mechanical 
impact and vibration while requiring repetitive 
motion, and often producing dangerous and toxic 
dusts. Fortunately, turnover rates are relatively high 
in the lower paying jobs which helps to limit 
prolonged exposure. There is a great need for 
practical hands-off finishing process capability. 

The application of abrasive processes to meet 
finishing needs is diverse and widespread in 
manufacturing. The abrasive industry, from mining 
through tool design, manufacture, and distribution, 
is a multi-billion dollar per year industry. The 
number of permutations of tool size, shape, abrasive 
type, and other characteristics of manual abrasive 
tools is astronomical. Finishing processes are 
typically performed manually, the process is planned 
based on knowledge and experience, and controlled 
by the operator with his senses of sight, touch, 
hearing and even smell. Desired output is often not 
measured but judged. Automatically controlling 
finishing processes in the same way as human 
operators control them would be difficult to achieve 
in practice. Other methods of automation do work. 

Scores of robots are now deployed in U.S. 
factories finishing a wide variety of parts^. There 
are many factors which have contributed to tlie 
growing commercial success of robotic finishing. 

The dominant success factor is the pioneering effort 
by 3M and others to develop the compliant abrasive 
finishing processes. The key is compliance between 
part and tool, critical because it reduces variation of 
tool force, a dominant process control parameter. 
Even slight variations on the order of a few 
thousandths of an inch in the relative location of the 
robot's commanded tool point path and the actual 
contact point will result in large changes in tool 
force reactions if the system is too rigid. Normally 
occurring tool wear, part-to-part geometric variation, 
manipulator kinematic error, and set-up errors drive 
rigid abrasive processes out of control. There is a 
wide variety of abrasive tools available which are by 
the nature of their material composition, naturally 


compliant. Also, several constant force devices are 
commercially available which provide added 
compliance in one direction, usually normal to the 
surface of the part. With constant force end 
effectors, more aggressive abrasive tools can be used 
with robots. 

The Need for Automatic Programming 

A major barrier to further finishing robot 
deployment is system programming. Generating the 
robot motion control sequence is a major problem in 
manufacturing operations which produce parts in a 
variety of complex shapes. One manifestation of the 
programming problem occurs when a variety of 
complex shape parts must be finished. Each part 
type requires only one program which is executed 
repeatedly for its specific part type, but the programs 
have a large number of taught points. An example is 
the polishing of large asymmetric shaped aircraft 
skin panels. One manufacturer determined that 
manual teaching of robotic polishing programs was 
to too costly when their part mix approached eight to 
ten part types. 

In cases requiring a unique motion sequence to 
finish each individual part, manual methods must be 
used because of the lack of practical programming 
methods. For example in aircraft remanufacturing, 
aerodynamic surfaces dented in normal service are 
routinely repaired by applying filler materials which 
are ground to shape by hand. Since the location and 
extent of this type of damage is random, each part 
requires a unique grinding program. Unique plans 
are commonly required for finishing turbine blades 
and propellers. The location and amount of excess 
material left by machining and forging is not entirely 
predictable. An automatic programming system 
which can be integrated with commercial robotic 
surface finishing equipment is needed. 

Automatic finishing with compliant abrasives 
does not require automatic process control and 
planning. Planning is uncomplicated for many 
applications because process settings such as 
abrasive type, tool cant angle, tool force, tool feed, 
and tool speed remain constant for most or all of the 
individual job. Path planning and the subsequent 
trajectory generation still require too much manual 
effort for many robotic applications. 

The approach taken here to developing 
automatic programming for robotic finishing is to 
first automate the path and trajectory generation 
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while depending on the human operator for higher 
level process planning, supervision and control 
Interviews with end user companies revealed that 
using CAD design data as a basis for modeling 
surfaces for path generation purposes would be 
impractical. So reverse engineering the surface to 
create a geometric model was included as a system 
requirement. For reasons of maximizing practical 
application success, commercial products are used to 
the greatest extent possible. Furthermore, overall 
system cost is minimized so that smaller companies 
will more readily be able to acquire the technology. 
This approach serves the objective of providing 
valuable off-the-shelf programming automation 
while providing a basis for the development of 
intelligent and more fully automatic processing. 

System Equipment 

A personal computer(PC 486-50mhz) and Faro 
Metrecom Coordinate Measuring Machine(CMM) 
are used with MS-DOS and AutoCAD as the 
platform for the automatic path and trajectory 
programming system. The Faro Metrecom CMM is 
a 6 DOF spatial digitizer interfaced to AutoCAD. 
AutoCAD's 3D modelling capability allows creation 
of, access to, and modification of geometric objects 
through AutoLISP and C. Customization of 
AutoCAD's environment through menu building 
using AutoLISP and Macros makes the objects 
accessible and easy to manipulate. The Fanuc model 
S-700 is a 6 DOF serial link manipulator with multi- 
tasking capabilities using the R-J controller. The R- 
J controller is off-line programmable. The Karel 
language programs can be written and compiled on a 
PC workstation. The R-J controller interfaces to a 
PC from which compiled robot control code can be 
accessed. The source code is written in ASCII 
format in Karel, then it is formatted and translated to 
executable code using the Karel translator program 
supplied by Fanuc. A server supplied by Fanuc is 
used to download the translated code to the 
controller. The PushCorp constant force end effector 
which was developed as a compliant abrasive end 
effector for surface finishing has a dedicated 
controller with interface for the R-J controller. The 
end effector mounts directly onto the faceplate of the 
robot with the axis of rotation of the grinding disk 
parallel to that of the faceplate. 


Seiup-Erocedure 

Typically surface models for off-line 
programming applications are generated using 
consmjctive solid geometry (CSG). In this case the 
geometry will be recreated in a CAD environment 
from digitized surface location data in a process 
called "reverse engineering". Before digitizing the 
surface a common coordinate system is established 
between the CMM and the robot. 

The robot can relate a tool frame to a base 
frame and provides several programmable frames of 
both types as well as one predefined frame of each 
type. Since it is the location and orientation of the 
grinding disk that is of importance a tool frame is 
taught at the center of the disk with the CFD 
positioned at the midpoint of its compliance stroke. 
There are different methods of teaching a tool frame 
to this robot through the teach pendent menu, and 
the method used here is the 3 point method. The 
desired tool point is placed at some fixed point in 
space which is taught three times with three 
completely different end effector orientations. This 
point on the tool can then be moved in space to any 
point within the robot's workspace relative to the 
base coordinate system or a user defined coordinate 
system. 

Next, a user defined coordinate system is 
defined using a 3 point method for the robot and the 
CMM simultaneously. First a new origin is taught to 
the robot, and then the same origin is taught to the 
CMM by placing its tool point at the tool point of the 
robot and recording it. The robot is then moved 
parallel to the X axis of the base coordinate system 
several inches away and taught a point on the new X 
axis which is followed by teaching the CMM the 
same point to define it’s X axis. Finally a point is 
taught to both devices on the positive part of the XY 
plane. The CMM automatically translates to this 
new system, and the robot can be instructed to do so 
through the teach pendent or through off-line 
programming tools. 

This entire process can be done in 10 minutes 
and is only necessary for initialization or when the 
set-up is disturbed. Once the two devices share a 
common coordinate system tlie Faro CMM can be 
used to generate a CAD model of the surface to be 
finished, and die coordinates on the surface will be 
the same with respect to the CMM or the Fanuc 
Robot. 


642 


Surface Model 


The Faro/ AutoCAD interface allows the user to 
move the endpoint of the CMM digitizer through 
some trajectory in 3D space and generate a 
3dpolyline entity with complex entity nodes at a 
predetermined interval. By keeping the endpoint of 
the digitizer in contact with a contoured surface, a 
3dpolyline that lies on that surface is created. 

AutoCAD has the capability to generate a 
Coons Patch based on four bounding lines in space 
that are connected at their endpoints. The number of 
facets, or mesh segments are programmed using the 
variables surftabl and surftab2. The importance of 
this is that facets will be used to determine the path 
node frames of the end effector trajectory. Mesh 
density in the direction of travel will affect the 
smoothness and accuracy of the motion while mesh 
density in the direction perpendicular to the direction 
of travel controls the spacing between passes. 

An AutoLISP program is activated through the 
customized pull down menu at which time the Faro 
3dpolyline programs are executed and request the 
user to drag the digitizer over the desired boundary. 
This operator defined boundary encloses the region 
on the surface to be finished. The program then 
automatically performs the required modifications of 
the polylines to ensure that they are connected at 
their endpoints. The result is four 3D lines in space 
connected at their endpoints forming the bounding 
edges of the Coons Patch surface of m by n surface 
facets. Upon completion of generating the four 
boundary lines and assigning the entity data lists to 
variable names, an interpolated surface is 
automatically generated using the Edgesurf 
command. This command requires selection of the 
four existing bounding edges and is answered by the 
program with information from the stored entities. 
The direction of travel is requested upon completion 
of the mesh generation process and can be in the m 
or n directions. 

Extracting Tool Pose Information from 

the Surface Modd 

The problem of determining robot tool poses 
from surface data involves several steps. A brief 
outline is helpful in describing the work tliat has 
been automated. 


1. Create or access an exisiing surface 

A function is implemented to number the nodes 
of the mesh and add this integer data to the actual 
entity data which was created by AutoCAD. This is 
done to simplify the development of an algorithm to 
access individual nodes and their immediate 
neighbors on the surface. 

2. Determine a unit surface normal vector 

This is done by averaging several normal 
vectors in the region. Each node P being considered 
as a path node has four immediate neighbors 
including two in the direction of travel and one each 
on either side perpendicular to that direction. These 
will be referred to as Pj, P 2 , P 3 , and P 4 The cross 
product between vectors originating at P and 
terminating at two of the neighboring nodes yields 
an approximate normal vector to the surface at that 
point. An algorithm is implemented that performs 
this same operation for four quadrants of the 
irregular quadrilateral formed by these points with P 
as the origin for each vector which results in four 
approximate surface normal vectors at the node P. 
These four approximate surface normals are then 
averaged and normalized to get an approximate unit 
surface nonnal at the node P. This procedure is 
perfonned for each node on the surface excluding 
those on the bounding edges. 

3. Define an angle of attack 

When the program that calculates and draws 
the surface normals is complete, another function is 
automatically called that requests a cant angle^for 
the tool. This angle is the pitch angle of the tool that 
will be used for the finishing process. The user can 
type a number in degrees at the command line after 
which the information is used to calculate tool 
frames at each node of the path. 

The calculation of tool frames is based on the 
transformations found in Craig's Text^. The result is 
shown graphically on the screen in the form of a 
coordinate system at each node in the orientation 
that the end effector will be in at that node in order 
to maintain the desired cant angle with respect to the 
surface and the direction of travel. 

4. G ene rate the Kar el Pro g ra m 

At tliis point the tool frame information must 
be expressed in the language understood by the 
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robot. The user is prompted for a filename which 
will be used to store a Karel program in the form of 
fname.V\ and a data file in the form of friameLkl , 
Jhame2.k\ and so on for each branch. Another 
program then implements an algorithm developed to 
determine which nodes belong to a branch of the 
path, and builds the path as a list containing 
branches which them selves contain lists. A branch 
list is of the form (1 5 6 7 8) where the first number 
is the branch number and determines it's order of 
execution among branches, which is followed by the 
node numbers which define which nodes will be 
traveled to in this branch and their order. Each 
branch begins at a boundary and follows the mesh in 
the previously chosen direction until it reaches the 
opposite boundary and will include all nodes along 
the way that are not actually on the boundary. The 
initial and final nodes are extrapolated linearly based 
on the two first or last nodes respectively with a user 
defined offset in the direction away from the part in 
order to allow a smooth approach or departure to or 
from the surface. These extrapolated nodes will not 
show up in the path data list and are calculated just 
before being written to the data files 

The final path variable will look something 
like this : 


types of data formats in Karel including the type 
path. A path type variable is a structure which can 
contain several other specific types of data including 
position data in the form shown above. Several 
positions and orientations can be stored in a single 
path variable, and then used in a move_along 
statement which instructs the robot to move it’s end 
effector through those points sequentially. There are 
also motion control statements that can be used in 
conjunction with the move^along statement that 
affect speed and acceleration between nodes. 

A LISP program is then called that writes a 
Karel program to be translated and sent to the 
controller. The Karel program is then loaded into the 
controller and executed at which time it reads the 
data files and builds path variables in a routine 
called build_path. Each branch of the path is built 
before being used in motion statements that cause 
the robot to follow them. The move_along 
statements are executed sequentially and the tool 
moves aCTOSS the surface as planned. 

The actual grinding process can be initiated 
either through the Karel program by way of an 
option included in the custom menu, or by using the 
teach pendant. 


(pal (1 7 8 9 10 11) (2 18 17 16 15 14) (3 20 21 22 
23 24)) 

and will be used to write the path data files. The path 
data file contains only information that is relevant to 
the Robot controller and it's path data structure. 

Each branch of the data is written to a data file in a 
specific format that is shown in the following which 
was taken from an actual data file. 


Conclusions 

A practical method of automatically generating 
tool point paths and robot trajectories for surface 
finishing has been developed and implemented using 
off-the-shelf technology components. This method is 
expected to only increase average robotic finishing 
system costs on the order of 5%-10% of total cost. 
High level process planning and supervision remain 
under human operator control. 


536.6210 103.9143 
479.4422 114.2409 
422.2633 124.5675 
366.8345 136.8362 
312.7229 146.2052 
259.5330 158.6093 
206.3431 171.0134 


- 208.2082 175.8836 ! 
- 259.2189 175.8836 
- 259.4296 178.8781 
- 270.6763 - 177.4433 
- 288.9642 - 174.9756 
- 308.7990 - 173.1321 
- 277.8339 - 174.9756 


5.6680 - 9.1591 

3.6680 - 9.1591 
- 5.6877 - 11.3613 
- 14.8075 - 11.1896 
- 19.1880 - 11.4856 
- 20.1050 - 13.6317 
- 19.1880 - 11.4856 


More research is needed to develop practical 
process planning and control methods to supplement 
the operators process knowledge so that new 
applications requiring new processing recipes can be 
quickly implemented. Also sensor applications must 
be developed which augment the operators ability to 
assess surface condition and geometry. 


This data file represents branch 1 from the 
path list and each line in the file represents the data 
for X, Y, Z, and the Euler angles required to define 
position and orientation of the end effector for a 
particular point on the surface. There are several 


A longer range objective is to develop 
automatic finishing as a smart processes which can 
be driven using feature based CAD data from 
intelligent high level supervision and control 
systems. Automatic processing capability must be 
broadened beyond the compliant abrasive process 
methods used today. 
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Efforts to build the manufacturing information 
technology infrastructure to support intelligent agile 
production continue, but the automatic planning and 
control of most shop floor processes including 
finishing is not adequately understood. Furthermore, 
the application of sensor technology needed for 
automatic observation to support finishing process 
control is not understood. Much empirical work will 
be done to discover practical process planning, 
control, and sensing methods for automatic 
finishing. The automatic path generation capability 
developed through this research will immediately 
improve application development productivity. 
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ABSTRACT 

Traditional teleoperation depends solely on operator skill 
for effective task completion. This dependency limits the 
class of operations which were suitable for teleoperation. 

In many work situations today, external and operational 
requirements necessitate a more efficient operational 
scenario. The potential environments are more complex 
and dangerous and require integrated safety systems. One 
way to enable more effective control is to share control 
between the human operator and an intelligent computing 
system. This paper discusses the requirements and an 
implementation of a virtual environment for telerobotic 
shared control. 

Efficient, effective and safe system operation depends on 
the operator's ability to make wise decisions. In order to 
make wise decisions the operator needs a clear 
xmderstanding of the operational requirements, and an 
imderstanding of the external factors which affect the 
operation requirements. A virtual environment built upon 
a shared control interface can provide much of the 
information required for safe operation. The virtual world 
allows the operator to clearly visualize the task and 
provides feedback that is not otherwise available. The 
shared control interface verifies operational demands 
against system constraints. The virtual environment 
provides for a practice and training environment where 
mistakes can be made. Only after an operation has been 
cleared for execution does physical motion take place. 
Proper linking of human input with computer controlled 
heuristics greatly increases the safety level of the system. 

The knowledge base that the computing system uses to 
perform decision making is called the World Model. 
Visual information from the World Model is displayed 
graphically as rendered, shaded, texture mapped animated 
polygons. The core of the shared control system is the 
computational engine that maintains, displays, visualizes 
and controls interaction with the World Model. The 
computational engine can be interrogated at several 
different levels. During operation: multilevel interaction 
defined by the program (mouse, keyboard, multi-modal 
input devices). The visualization system offers a high 
interactive operating environment, enabling users to 


change objects, display attributes, and elements of the 
World Model in real time. 

High speed and optimal interaction with external input 
and teaching devices are required for effective interaction. 
Traditional teleoperation allows the user to effect 
immediate changes in manipulator configurations via an 
input device(s). Anything from a simple joystick to a force 
reflecting master have been utilized. The virtual 
environment uses these input tools and more advanced 
tools such as datagloves to direct user input into the 
graphics environment. This enables the use of the graphics 
environment for training, simulation, design interactions 
and will allow the World Model to interact and interpret 
operator commands. 

2. INTRODUCTION 

The integration of a graphical-based programming and 
control environment has the potential to enhance use and 
utility of robotic systems. Traditional off-line robotics 
programming methods address some of the needs for 
increased productivity of robotic systems. A more 
complete solution will address simplifying the 
programming process, direct linking of the graphics and 
control environment, the needs of cell calibration and 
model registration, and the needs of sensor- directed robot 
control. The graphic model part of the solution allows for 
extensive planning and conceptualization to occur without 
the use or need for actual equipment. The direct control 
side of the system allows for the seamless communication 
of commands and information from the graphical model to 
the device controller, and from the device controller and 
system sensors back to the graphics model. The 
sensor-based control requirements and the inclusion of 
calibration methodologies allow for confidence that the 
modeled world corresponds to the actual environment. The 
research described here represents our efforts to design 
and construct complete tools for enhancing robot or 
intelligent device planning, programming, and control 
processing. 


Copyright © 1993 by Deneb Robotics. Published by the American 
Institute of Aeronautics and Astronautics, Inc. with Permission. 
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3. HAZARDOUS ENVIRONMENTS: A 
TECHNOLOGY DEVELOPMENT FOCUS 

An application focus is an important element of 
technology development. An application focus not only 
provides an understanding of which technology areas 
should be addressed first, but frequently suggests which 
technology approaches will be the most fruitful. For 
example, the highly structured nature of most 
manufacturing environments fostered the development of 
pick-and-place robots and strongly suggests that fixtures, 
not real-time sensing, is the most appropriate way to locate 
objects in the robot’s work space. Teaching, in which the 
operator manually moves the robot to a location in the 
environment and stores that location (usually based upon 
encoder readings) in the robot's computer memory is the 
most common form of robot programming in 
manufacturing environments. During operation, the robot 
is instructed to repetitively return to some predetermined 
sequence of these previously taught locations. Fixtures 
assure that the workpiece is where it is expected and the 
robot carries out its action blindly. Production rates, not 
safety, are of primary concern. Personnel safety is typically 
provided by excluding people from the robot's workspace. 

If a part's location is not correct and an accident were to 
occur, the part or the robot might be damaged, but 
extensive damage is not expected. 

Hazardous environments, however, place a premium on 
accident-free operation due to potential serious 
consequences of damage to the workpiece or environment. 
Rather than assuming that all objects in the robot's 
workspace are where they are expected, as in 
manufacturing environments, objects are always 
anticipated to be in unexpected locations in hazardous 
environments and all robot motions must be continuously 
monitored and validated. 

Much of the original work leading to the development of 
graphical programming technologies at Sandia National 
Laboratories was performed to enable the application of 
robotic systems to hazardous nuclear and toxic 
environments. Due to the desire to limit personnel access, 
it is desirable to program robot systems within the context 
of a model of the environment, rather than use the 
teaching approach which would require operator entiy into 
the hazardous environment. Without operator entry for 
close-up observation, teaching-based manual 
programming is very difficult. Thus, it is desirable to 
reduce the need for detailed operator programming and to 
transfer much of the reasoning task to the computer system 
(including sensors) controlling the robot. The knowledge 
base which the computing system uses to perform decision 
making is termed the World Model 


To allow the robot's control computer to intelligently 
reason about the environment, environmental sensing 
must be provided. Environmental sensing allows dynamic 
updating and validation of the World Model so the 
computer system's reasoning process is based upon a valid 
model. Real-time sensing increases operational safety by 
allowing the computing system to adapt the robot's 
movements to compensate for uncertainties in the World 
Model Under no circumstances may the robot be allowed 
to increase the hazard associated with the already 
hazardous environment. If this occurred, robots would not 
be used because safety is an overriding issue when dealing 
with hazardous environments. 

4, DEFINITION OF INTELLIGENT ROBOTS 

Since the most common commercial robot applications are 
for repetitive operations requiring no sensing or decision 
making, robots are frequently envisioned as devices 
capable of only this type of behavior. Based upon the 
discussion above, many hazardous environments clearly 
require intelligent robot systems with more advanced 
operational characteristics. Thus, it is worthwhile to define 
what the term intelligent robot system means in the 
context of this paper. 

Intelligent systems are systems which can make 
appropriate decisions when presented with situations. 

Such decisions include selecting motions which 
accomplish a goal without collisions with objects in the 
environment. John Hopcroft viewed robotics as the study 
of representing and reasoning about physical objects in a 
computer. Incorporating the more traditional mechanistic 
view of robotics provides the definition used here: 

"Robotics is the integration of the sciences of sensing, 
representing and reasoning about physical objects in a 
computer coupled with electromechanical systems to carry 
out purposeful actions." * 

In addition, intelligent robots possess skills with well 
characterized capabilities which can be used to accomplish 
tasks. A skill, for example, might include applying 
well-formulated knowledge about forces of interaction to 
perform in-contact operations, such as assembly, without 
operator intervention. 

A reasonable goal for an intelligent robotic system is to 
serve as a supervised electromechanical system possessing 
sufficient intellect to serve in the place of a similarly 
supervised human. In the event a robot manipulator cannot 
accomplish its task due to errors, it either re-plans or 
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requests help from the supervisor, much as the replaced 
human would do. While intelligent robotic systems 
possessing the full range of capabilities implied by this 
goal may be beyond the current state of the technology, 
significant strides are being made. 

5. A CONTROL ARCHITECTURE FOR 
INTELLIGENT SYSTEMS 

In order to satisfy the performance characteristics 
described above, a structured computing system has been 
developed which allows incorporation of the wide range of 
capabilities required for the successfiil operation of robot 
systems in hazardous environments. The required 
capabilities range from fast, servo-level response based on 
sensor inputs (e.g., follow a surface contour based upon 
interaction forces) to slower responses in which evaluation 
of various alternatives is involved (e.g., planning a 
trajectory for the manipulator end point). The computing 
environment must integrate all levels of control into an 
efficiently executed control strategy which smoothly 
transitions from one control mode to the next. The basic 
computer architecture used for intelligent robot control is 
shown in Fig. 1. The hierarchical control environment has 
two main systems; the reasoning system in which high 
level control processes are performed, and the real-time 
control system in which fast response control processes are 
carried out. 

Within the reasoning system, the computer constructs an 
approximate World Model based upon knowledge about 
the environment (e.g, a map), robot characteristics (e.g., 
kinematics) and heuristics about objects in the 
environment and their behavior (e.g., physical limitations 
of the robot). The reasoning system also displays various 
aspects of the World Model (e.g., geometric models, x-y-z 
plots, parameter traces) for operator understanding and 
assistance in decision making. As indicated above, this 
World Model is modified by sensory information and 
provides the foundation for automatically generating plans 
(e.g., a collision-free robot trajectory) which are translated 
into robot manipulator motion primitives. The control 
processes taking place within the reasoning system can be 
quite complex and may require considerable computing 
time. 

Within the real-time control system, servo control of the 
robot is accomplished. Responsiveness of the control 
system is extremely important as it is responsible for 
executing the robot motions developed by the reasoning 
system or supplied by direct intervention by the operator. 
A slow real-time control system would introduce delays 
between the commanding of movement and the execution 
of that movement by the robot, leading to instabilities in 


the system. Sensors and the World Model are employed to 
monitor the execution of these motions and to 
automatically perturb the robot motions if necessary to 
provide safe operation while accomplishing the desired 
task. Situations requiring direct operator control vary from 
the teaching of robot locations to the recovery from errors 
with which the robot control system is unable to cope. 
Experience with master/slave manipulators suggests that 
even highly trained operators experience considerable 
difficulty and tedium when executing remote 
manipulations.^ Tedium can result in unsafe operation. 
Thus, direct operator commands are also monitored by the 
intelligent control system for compliance with safe 
operating practices and accepted procedures. 

Within the context of Fig. 2, the operator takes on the role 
of planner and develops task plans for the robot 
manipulator to execute. However, the computing and 
sensory systems maintain their role as developers of 
approximate World Models and real-time controller. Much 
in the manner that the real-time control system perturbs 
the robot commands of the supervisor, the real-time 
control system now perturbs the commands of the operator 
within the context of a World Model and sensory system. 
The real-time control system assists the operator in 
automatically avoiding obstacles and executing controlled 
interactions with the environment while the operator 
performs the high-level task and path planning. Such 
computer assisted approaches to manual operation have 
proven effective in providing responsive robot manipulator 
systems capable of safe flexible operation when manually 
operated. ^ 

5.1 Geometric Modeling 

The intelligence of an intelligent robotic system resides in 
the World Model and algorithms for accessing and using 
the knowledge contained therein. The representation of the 
World Model is critical and the subject of much study 
because the efficiency of information retrieval determines 
the usefulness of the knowledge. ^ Robots deal with 
physical objects and, as stated by Hopcroft, reasoning 
about physical objects is the key element to constructing 
intelligent robots. A robot's ability to interact with a 
physical object is, to a large extent, defined by geometries. 
Effective geometric reasoning is the key element in 
constructing practical World Models, 

5.2 The Role of Visualization Models 

The World Model as described here encompasses much 
more than a model of geometric objects. The World Model 
is the visualization conduit providing the operator with 
insights into the performance of the control system and 
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displaying the results of control decisions. In addition to 
the geometry of an object, the model includes the 
kinematic, dynamic and motion characteristics of all 
movable objects in the environment. The World Model 
contains the knowledge to understand and predict how a 
controllable device will move and provides insights into 
how to plan a trajectory or path and in some cases may 
include knowledge on how to carry out tasks or complex 
operations. The World Model functions as the operator’s 
window to the control of the intelligent system. It acts to 
validate command operations, provide real-time feedback 
to the motion or changes in the system, and provides an 
ideal place for quality assessment and data logging of all 
operator commands and the results of those conunands. 
The visualization of the World Model is the key to 
operator understanding of system operation. 

5.3 Simulation vs> Control 

0\^er the past few years, much work has centered on the 
simulation of robots. In many cases this was the first step 
in off-line programming of these devices.^*’^ For static 
environments this may be all that is necessary. However, 
many of the control tasks of today are not static situations. 
These environments are typified by the challenges of 
remediation of hazardous waste or the manufacturing 
environment of flexible manufacturing systems. In these 
operations the work environment is not always well 
characterized before work begins, and may in fact change 
as the system operations are executed. The dynamic nature 
of these environments require a close coupling of the 
simulation and control systems. As updated information 
about the work environment is made available, the 
simulation or planning system must be capable of 
responding to the updated information and perturbing the 
commanded operations in response to these changes. The 
updated system information may be made available from 
sensors or from changing production requirements for 
flexible manufacturing systems. 

In addition, the close coupling of the simulation and 
control system allows for safe integration of operator 
commands into the control of the robotic systems. The 
visualization system of the World Model acts as a filter to 
the operator commands. The World Model tests and in 
some cases perturbs the command before it is 
implemented. This is critical for the safe operation of 
systems requiring operator intervention. 

5.4 Graphical Pro 2 ramming and Control 

The goals of the development of the graphical 
programming and control system are faster and safer 
system operation and enhanced operator programming. 


The very nature of a 3-D representation of a modeled 
system enhances user operation, user understanding and 
aids user visualization. Graphic representations make it 
easy for the operator to program the system with icon 
selection and interactive manipulation of the graphic 
images. 3-D representation also enables whole-body 
collision detection not available anywhere else. Through 
simulation of camera views a camera is virtually presented 
to the operator. These views serve as an enhancement to 
live camera views, which may be limited and incomplete, 
and can enhance operator understanding of the work 
environment. 

Control decisions which are deemed safe within the 
context of the World Model are conununicated directly to 
the intelligent devices. Any translation is done on line and 
as part of the communication process between the 
reasoning system and the real-time control system. The 
motions caused as a result of these commands are fed 
directly back into the visualization system for recording 
and operator understanding and as a check on the validity 
of the commanded motions. In addition, sensors present in 
the intelligent devices locally verify modeled geometry and 
communicate discrepancies to the World Model The 
World Model is dynamically updated and geometries 
constructed or modified as necessary. The updated 
information is then available for all future control 
decisions. 

6, Implementation and System Description 

A commercially available graphical robot simulation 
environment IGRIP from Deneb Robotics, Inc., was 
combined with the Sandia developed Generic Intelligent 
System Controller (GISC).’ ® IGRIP was operated on 
Silicon Graphics workstations. Thus, we have built upon 
rapidly improving graphical computers and the large base 
of computer graphics capabilities. As the state of the art in 
high speed graphics computers and computer graphics 
improves, these advanced capabilities can be directly 
integrated. A critical area of development in the graphical 
programming of intelligent robots is providing high speed 
and efficient communication between the graphics World 
Model and the intelligent sub-system control elements. 
This is essential to allow graphics models to send 
commands to other processes and to allow other processes 
to send commands, interrupts and positional information 
to the graphics World Model for model updating and 
display. 

The Low Level Telerobotic Interface (LLTI) developed 
jointly with Deneb Robotics to allow communication at the 
binary level between processes was the first attempt to 
tightly couple external data to the animation display. This 
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high-speed interface is essential to operator understanding 
of changes in the actual robot environment. It is also 
essential for proper visual feedback of operator-directed 
changes in the graphic environment using external input 
devices. 

CISC allows the operator to control both the simulation 
system and multiple disparate programmable devices from 
the same control environment and in the same 
programming language. This is enabled by the use of the 
Sandia developed Robot Independent Programming 
Environment (RIPE) and in the Robot Independent 
Programming Language (RIPL).^ The use of RIPL allows 
the system designer to communicate to many different 
programmable devices in the same manner. At one level 
the use of RIPL allows the programmer to communicate 
and direct the robot from different manufacturers in the 
same language syntax and structure. At another level 
through combination of the World Model and RIPL 
language, the operator can program any robot by simply 
graphically causing the robot model end point to move in 
response to external inputs. The benefits of the graphics 
environment is that the operator immediately sees the 
results of his directions and the operator is warned of 
potential collisions between the robots and objects in the 
environment. In fact, motion plans which would 
potentially cause collision are not permitted to be 
communicated to the controller and executed. 

GISC employs a distributed computing environment for 
supervisory and real-time control of the robotic system, as 
shown in Fig. 1. The computing environment consists of a 
Silicon Graphics Iris (SGI) workstation which runs IGRIP 
for World Model display and animation, and also runs 
associated data communication processes. A Sun 
computer was also interfaced to the control environment 
for display of various non-geometric aspects of the World 
Model Interfaced to the SGI are multiple Motorola 68030 
single-board microprocessors resident in VME backplanes, 
which provide real-time, sensor-based control. The RIPE 
interface software translates the high-level commands 
from the SGI into sequences of low-level commands 
understood by the controllers and sensors. The robot 
controller and sensors are interfaced to the computing 
environment by the Sandia developed Intelligent Robot 
Operating Environment (IROE).^° IROE is a real-time, 
multitasking operating environment built upon the 
VxWorks operating system (Wind River Systems, Inc.), 
and was specifically developed for the communication 
demands of real-time sensor directed-control of intelligent 
robot systems. 

The graphics environment is an ideal environment in 
which to off-line program the robotic system. To minimize 


system downtime or to limit operator programming time 
on the system it is desirable to pre-plan the intended 
motions of the robotics system as much as possible. 
Accurate 3-D geometric models of the robot, its end 
effectors, and the fixtures and workpieces in the 
environment allows the robot programmer to carefixlly 
plan robot trajectories. Since the robot motions are 
simulated using accurate kinematic models, the 
pre-planned motions faithfiilly reproduce the actual 
motions of the robot. In addition, task execution time can 
be determined and analyzed for system bottlenecks. 

Each robot typically has its own high-level programming 
language. The code to program the robot is written much 
like any other code, with the exception that the positions 
the robot is to move to are manually taught to the system. 
The integration of the graphical interface with an input 
device allows programming of the motions of the robotic 
system without having to write code. This can greatly 
speed the programming process and decrease the 
programming skill level required of the operator. 
Programming is thus realized by the operator causing 
motions of the simulated robot, the computer system 
remembering the commanded motions, replaying and 
displaying the conunanded motions to the operator, and 
then with operator acquiescence, downloading the 
programmed motions to the robot controller and causing 
the motions to be executed. 

The system software is written in such a way that any 
input device (e.g., teach pendant, spaceball, 
force-reflecting master) can be used. The control 
commands from the input device are sent directly to the 
graphical system. The control system determines how 
these commands are to be interpreted, {e.g . , the movement 
of individual joints, or end point control based on a tool 
frame). The result of these commands are displayed 
real-time in the graphics environment. The motion 
information is combined with the collision detection 
capability contained in the World Model Thus the 
operator can be alerted to the close approach of the robot 
to known objects in the environment. 

The very nature of the graphics model allows for the 
ability to detect collisions between modeled objects. This 
ability is very important for the safe programming of 
robotics systems. It is undesirable for a robot to have an 
uncontrolled collision with objects in its environment. 
Such interactions are typically conducted at slow speeds, 
with force sensors measuring and controlling the forces of 
interaction. While an operator's attention is generally 
focused on the end of the robot and on the task at hand, 
another part of the robot, such as the elbow, may collide 
with objects in the environment. The collision detection 
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capability of the graphics environment detects and warns 
of all such impending collisions. In addition, both the 
parts selected for collision detection and the warning 
distance are user- selectable. This ability allows the task 
planner to select the objects of importance and to 
dynamically alter the warning levels based on the task at 
hand. This also allows for different collision detection 
capabilities depending on the skill of the operator. 

6.1 System Operations 

We have presently employed this system in two 
laboratory-scale systems and one full-scale demonstration 
for critical feature testing. In each case, the graphical 
control environment is used as the programming and 
control interface for CISC to produce a faster, safer, and 
more economical system. The first application of this 
technology was a laboratory demonstration relating to the 
remediation of nuclear waste buried in underground 
storage tanks. The second application of this technology 
was the safe tele-operated inspection of a nuclear waste 
transportation cask.^^ The full- scale demonstration 
involved three robots working together to map a mock 
underground storage tank and then remove the simulated 
hazardous material.*^ 

The graphical control environment described has been 
implemented and demonstrated in a Sandia laboratory test 
facility designed to demonstrate the robotic 
characterization and remediation of hazardous waste 
stored in underground tanks. The foundation World Model 
contains 3-D geometric models of the system generated 
from construction drawings. The system and its graphical 
representation can be seen in Fig. 3. The test work 
environment consisted of a gantry robot (Cimcorp 
XR6100) and a 2.4 by 2, Im rectangular tank filled to a 
depth of 0.6m with moist sand to represent the waste. The 
test tank contains both buried objects and pipes, as well as 
structures protruding above the surface of the waste to 
represent the types of obstacles that would be encountered 
during characterization and clean-up of actual storage 
tanks. The waste surface and some obstacles were 
deliberately not modeled to test the control system's ability 
to detect and map unknown obstacles. The World Model 
also contained models of the robot's kinematics and 
motion limitations and heuristics defining tasks such as 
mapping and waste removal. The initial robot motions 
were defined by operator, tested for safety and potential 
collisions within the World Model and executed. As 
motion occurred the robot model was driven by 
information received from the robot's encoders. As the 
sensor systems detected new information about the 
environment, such as waste surface profiles the data was 


communicated to the World Model and graphically 
displayed. 

The second application of the graphics control 
environment is the inspection of nuclear waste shipping 
containers. This system also made use of the Cimcorp 
gantry robot in conjunction with a quarter-scale mockup of 
a waste transportation storage cask. Through use of the 
graphical interface, the operator can direct inspection 
tasks around the container. The operator can pick up tools 
and direct inspection tasks with the assurance that 
collisions will be prevented by the use of the graphics 
control environment. This capability will be essential for 
safety inspections and during emergency operations. The 
geometric World Model and the graphics interface to the 
operator are used to graphically program the robot system 
and to verify safe operation. For example, prior to the start 
of a series of programmed operations, the computer 
compares the desired robot trajectories with the geometric 
knowledge contained in the World Model. Any unsafe 
trajectory (e.g., a collision between the robot and a model 
feature) is detected and reported to the operator via the 
graphics interface prior to robot motion. The operator can 
then modify the proposed robot trajectory or program the 
robot by simply manipulating the robot image in the 
graphic interface. These modifications to the robot 
trajectories are then verified for safe operation by the 
World Model. The computer system automatically 
reprograms the robot's movements to include the operator's 
commands. Only collision-free robot trajectories are 
transmitted to the robot. 

The full-scale demonstration encompassed and enhanced 
the capabilities developed in the laboratory 
demonstrations. The expanded capabilities included 
interaction and control of three different robot systems, 
increasing the system's ability to handle a variety of work 
environments, and a number of different types of tools and 
sensor types. The system consisted of: a RedZone RTI 
robot used for tank wall and weld inspection; a Schilling 
Titan 7F robot used for fine manipulation tasks mounted 
on the end of a 28-foot-long Spar hydraulic arm used for 
gross positioning. A drawing of the system can be seen in 
Fig. 4. Also the graphical representation can be seen in 
Fig. 5. The tooling included an ultrasonic mapping tool, a 
hydraulic cutter, and a small pneumatic chipper. 

The increased number of robots and tools tested the 
system's ability to program and communicate to different 
systems in the same language and using the same 
programming tools. The control structure and the 
command flow from the reasoning system to the real-time 
controller allowed seamless communication and control to 
each device. It also demonstrated the ability to perturb 
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commanded motion from updated real-time information. 

As in the laboratory demonstrations, surface information 
was mapped by the sensors and included in the World 
Model Based upon this mapped information, ideal task 
trajectories are generated. When these trajectories are 
executed the sensor system perturbs the motion to 
maintain a desired height above the surface, for example. 
The results of the perturbed motions are fed back into the 
graphics system informing the operator of the changes 
made to the original desired trajectory. 

In addition, the use of virtual camera views for operator 
feedback was implemented. The operator*s view of the 
graphics representation of the World Model was 
adjustable. Through use of a set of dials the operator could 
modify the scale, translation and rotation of the view or 
views as he/she desired. This ability allowed for more 
detailed inspection of those areas of the model which were 
of particular interest. It should be noted that foil collision 
detection of the entire model envirorunent was always 
available regardless of what was displayed on the screen. 

In the particular case of removing and replacing tools from 
the tool rack, the virtual camera view was clearer and 
more valuable than the actual camera view which was 
cluttered and often obscured the desired information. 

1. RESULTS & CONCLUSIONS 

The use of a graphical control environment for robotic 
systems can accelerate tasks such as the removal of waste 
stored in underground storage tanks. Advanced 3-D 
geometric modeling concepts can allow robot motion 
planning and thus, facilitate programming the system 
without manual code generation. This greatly reduces the 
requirements for detailed, step-by-step programming by 
skilled robot programmers. The geometric model can 
interpret operator manipulations of the graphical 
representation to automatically program the robot to 
respond in the manner desired by the operator. In addition, 
the World Model can also be used to validate operator 
commands to the robot system to ensure safe operation 
during manual control. Graphical display of the results of 
the operator's robot commands can provide operators with 
perspectives not available from direct video viewing 
commonly used in the tele-operation of remote systems. 
This increases system safety by warning of impending 
collisions, and is especially important for impending 
collisions away from the robot end effector where the 
operator's attention is frequently focused. If an operator's 
command could result in a collision, the control system, 
with reference to the World Model, can prevent execution 
of the command by the robot and communicate the source 
of the problem to the operator through the graphics 
interface. 
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Fig. 3 Graphical Representation of Waste Remediation Test Bed 
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Fig . 5 Graphical Representation of Full-Scale Waste Tank Remediation System 
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Abstract 

This paper describes the scenario-based, object- 
oriented approach used to specify the software 
architecture of the next generation of robotic 
controllers. We also discuss how we intend to 
implement a version of the controller via a 
multi-agent approach. We also describe our 
real-time, fault-tolerant, cooperative reasoning 
tools that we intend to use to facilitate 
developing the implementation of the controller. 
We also describe how we intend to interface 
existing applications and controller components 
to the tools so that they interact via objects 
detailed in the controller specification. 

Introduction 

Problem 

The manufacturing industry of the United States 
has become increasingly less effective as other 
nations pour money into research. To regain 
world technological pre-eminance, the 
Advanced Research Project Agency and the 
National Center for Manufacturing Sciences 
have launched a number of programs. One of 
these, the Next Generation Controller program, 
attempts to develop a standard for robotic 
controllers for the post- 1995 time frame. This 
paper will discuss the approach taken during our 
working for the program while staying within 
ethical boundaries concerning this crucial 
research program. The interested, authorized 
reader can obtain the actual specification 
document can be obtained from the National 
Center for Manufacturing Sciences.* 

Most of the robotic controllers used in 
America today reflect programming concepts 
that are decades old. Change is required to 
remain competitive. However, manufacturers 
will not invest in equipment unless the return on 
investment promises to far exceed any risks. 
Therefore, the characteristics of any controller 
proposed must satisfy two different general 
goals: risk reduction and performance. 

Robotic controllers must ensure several 
key items to facilitate risk reduction. First, 
current manufacturing practice must be 
supported; manufacturers are loath to shut down 


a working factory on a promise of efficiency. 
Second, current equipment must be supported. 
Finally, numerous existing applications and 
utilities must be supported. While each of these 
items will be discussed later in this paper, open 
systems largely address these concerns. 

Innovative robotic controllers must also 
provide vast performance improvements for 
acceptance, where "performance” encompasses 
several aspects. Obviously, controllers almost 
always have real-time deadlines to meet and so 
should afford accurate results via efficient 
computational processes. Controllers should 
also afford previously unavailable capabilities, 
better user-interfaces, and promises of more 
efficient programming. Since product 
acceptance eventually depends upon economics, 
either flexibility (expanded product-line or 
higher quality product) or lower life cycle cost 
(through lower product development costs) must 
be offered. 

Loss of market share and un- 
transferred technical innovations prompted the 
programs adoption of both risk reduction and 
increased performance. Current practice in the 
robotic control industry has hardly changed in 
decades (with the exception of a few, very well 
financed areas). The programs goals attempted 
to afford advanced features (such as art-to-part 
manufacturing) while facilitating a smooth 
transition from existing equipment and support 
software to the next generation of machinery. 

Program History 

This paper will try to explain the approach that 
was eventually, successfully used in developing 
a specification for the controller: scenario-based 
object-oriented analysis. This approach will be 
the central focus of the paper because traditional 
approaches failed dismally in attempting to gain 
enough support for developing a standard. 
Another key aspect of developing the standard 
was "cleaning the kitchen"; if too many cooks 
don’t make good broth, then using fewer cooks 
might help. It did. 
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Organization of Paper 

We first describe the scope of the controller's 
specification to provide context for the rest of 
the paper. We then describe the scenario-based 
methodology and how it can be employed to 
specify a controller. We then provide a 
description of the contents of such a 
specification. We finally analyze the 
effectiveness of our approach and draw some 
conclusions concerning the utility of the 
specification and the effectiveness of the 
proposed controller. 


Scope of Controller Specification 
Developing a national standard for robotic 
controllers necessitates employing a wide brush; 
painting a picture of robotic use and 

manufacturing needs in the near fiiture requires 
comprehensive coverage of the domain while 
affording a great deal of flexibility. 

Comprehensive coverage is required to examine 
manufacturing from the level of controlling 
motors in actuators to analyzing throughput of 
factory lines. We try to specify interfaces to 
almost any type of information that a user of the 
system might want to analyze or modify. 

Comprehensive coverage is also 
reflected in the very-varied concerns examined 
by the specification. Physical elements, such as 
sensors, effectors, payloads, conveyors, tools, 
and users are considered. Abstract physical 
elements, such as envelopes, schedules and 
enterprise expectations should also be 
considered. Abstract elements, such as those for 
configuration, are by far the most difficult, 
essential elements to include; different 
implementations employ different strategies for 
configuration, different kinds of elements 
requiring configuration, and different 
granularities of configuration. Given that the 
controller should address needs as diverse as 
composite baking, precision material removal, 
and many-axis (> 100) assembly, facilitating 
comprehensive coverage often required multiple, 
combinable representations. 

Comprehensive coverage also includes 
covering current practice: in terms of specifying 
interfaces to existing machinery, programming 
techniques, and support software. Hence, 
interfaces to facilitate interacting with programs 
for solenoids and C++ programs were both 
supported. 


The controller specification also 
reflected a great deal of flexibility. 
Interoperability and plug-replaceability are 
essential in new robotic architectures destined 
for commercialization. Such flexibility is 
facilitated by employing existing standards and 
an open, published architecture. With respect to 
this endeavor, the obvious standards to employ 
were EXPRESS and the Product Data Exchange 
Specification.^’^ 

The Product Data Exchange 
Specification (PDES) attempts to provide a 
standard mechanism for describing virtually any 
object that might be manufactured as a product. 
Hence, the numerous volumes of the 
specification are appropriate for addressing a 
variety of fields. The standard is hierarchical, 
building upon very primitive concepts such as 
Cartesian coordinates and measurement units to 
eventually describe features such as pockets and 
items such as resistors, 

PDES was written in the specification 
language EXPRESS, EXPRESS is an ISO 
standard (but currently undergoing revision for 
its next version), EXPRESS is well-suited for 
describing object-oriented concepts (see Object- 
Oriented Methodology), since it supports both 
inheritance and abstract data t>pes. 

Inheritance is the notion that items 
described at a higher level in a hierarchy are 
subsumed in the structure of objects placed 
lower in a hierarchy (e.g., all mammals have 
mammary glands). Abstract data types allow 
new representations for new items (e.g., a car 
can be represented as a data type and used in 
representing a fleet of cars). 

An example of EXPRESS code is 
provided in Figure 1. The first entity (object 
described) is a representation for a sine-wave. 
Such a type of line should inherit the 
characteristics of general Line's and is more 
abstract than lines described as 
Cartesian_sine_line and Spherical_sinejine. 

A sinusoidal line has several attributes, 
to describe the phase, amplitude and 
compression factor of the wave. EXPRESS also 
facilitates constraining the attributes of an 
object. This example uses the functions 
all_in_0_to_2pi and periodic_j)henomenon to 
describe relationships among values of attributes 
that must be present in valid objects. 

The specification of the sine-wave is 
used in the second entity to describe a series of 
sinusoidal lines (such as might be generated by a 


Fourier transform). The example allows for any 
positive number of sine curves to be combined to 
describe a Sine_series_line. The constraint 
expressed here states that there must be an offset 
for each curve specified. 

Much of the work in the United States 
with respect to both EXPRESS and PDES stems 
from the National Institute for Standards and 


Technologies (NIST). The ISO acceptance of 
EXPRESS as a standard has prompted a large 
number of tool vendors to also support 
EXPRESS. Similarly, a great deal of funded 
research employs PDES to encourage its 
acceptance. NIST also has developed a public 
domain toolkit for manipulating EXPRESS 
models to facilitate working with PDES. 


ENTITY Sine line 
SUBTYPE OF (Line) 

SUPERTYPE OF (ONEOF(Cartesian_sine_Iine, Spherical_sine_line)) ; 
(* 

An individual line described in terms of 
Ampliture*sin(PeriodCompression*Angle+Phase) 

*) 

Compression_coefTicient : REAL; 

Phase : REAL; 

Ampliture : REAL; 

WHERE 

all_in_0_to_2pi(Profile_element.Phase); 

periodicjjhenomenonfSineJine); 

END_ENTITY; 


ENTITY Sine_series_line; 

(* 

Represents a function as a summation of sines of the form: 

f(w) = Cl*sin(Alw+Bl) + C2*sin(A2w+B2) + C3*sin(A3w+B3) + .... 

Offsets (to, tl, ...) are also provided so that relative calculations can be performed, as in 
f(w) = Cl*sin(Al(w-tO)+Bl) + C2*sin(A2(w-tO)+B2) ... 

*) 

Offset: LIST[1:#1 OF REAL; 

Component: LIST|1:#] OF REAL; 

WHERE 

SlZEOF(Offset) = SIZEOF(Component); 

END_ENTITY; 

Figure 1 : Sample EXPRESS Model 


Object-Oriented Methodology 
Our goal in developing the robotic controller 
specification was to provide a set of standard 
interfaces by which various applications and 
equipment could communicate. We provided 
this interface by describing the set of objects that 
could be transmitted among applications and 
support software. We also specified the 
constraints placed upon these objects to ensure 
their validity. We also described some minimal 
performance requirements required of various 
classes of applications when generating and 
manipulating these standard objects. 

The object-oriented methodology 
employed and developed novel concepts in 


Object-Oriented Analysis (OOA) and Design 
(OOD). Because public domain tools support 
automated translation of EXPRESS code into 
C++ (arguably the most popular language 
considered to be "object oriented"), Object- 
Oriented Programming (OOP) is also supported. 

(X)A strives to discern the essential 
objects for describing a domain. OOD attempts 
to organize and describe these objects, their 
behavior, and their interactions. We employed a 
specific object-oriented technique. Scenario- 
based Object-Oriented Analysis and Design, to 
derive the controller architecture. These 
techniques are particularly good at clearly 
expressing concepts in a domain and at 
providing an audit trail to the source of the 



original object. To improve the clarity and 
utility of the our work, we employed Computer- 
Assisted Software Engineering (CASE) tools to 
express the products of our analysis and design. 

Terminology and Procedures 
Object-Oriented Software Engineering is a 
relatively new approach for developing software. 
The approach treats instances of data types as 
objects. The data types themselves are typically 
called classes. Walt Disney’s favorite car, 
"Herbie”, would be considered an object of the 
class "car". 

Classes have attributes describing 
characteristics of objects. These attributes are 
assigned values to reflect the characteristics of 
particular objects. Hence, the color attribute of 
the class "car" with respect to the object 
"Herbie" might have the value "white". 

Object-oriented techniques provide 
numerous benefits, a description of which would 
be beyond the scope of this paper. However, 
two of these advantages previously mentioned 
(abstract data types and inheritance) facilitate 
developing hierarchies of objects. 

The Composition Hierarchy pictured 
in Figure 2 was described via EXPRESS LIST'S 
in Figure 1; composition hierarchies express 
how multiple objects can be combined to 
describe more complex objects. Machines are 
excellent examples of composition hierarchies: a 
complex machine including tools, spindles, 
links, etc., can be succinctly expressed in a 
composition hierarchy. 



Figure 2: Composition Hierarchy 


The Generalization Hierarchy 
expressed in Figure 3 was described via the 
SUPERTYPE and SUBTYPE relationships as 
part of the EXPRESS code in Figure 1. 
Generalization Hierarchies facilitate less 
abstract classes inheriting attributes from more 
abstract classes. A clearer example might be 
machines: Abbrassive__waterjet would be less 
abstract than Material_removal_machine that 


would be less abstract than Making_machine, 
which would be less abstract than Machine. 
Abbrassive_wateijet*s inherit attributed from 
Material_removal_machine that inherit 
attributes from Making_machines that inherit 
from Machines. 



Figure 3: Generalization Hierarchy 


Scenario-based Techniques 
We employ scenarios to ascertain the various 
objects, their behavior, and their interactions in 
a given domain. Scenarios are timelines of 
events with respect to one or more objects. One 
can imagine a scenario as a movie depicting the 
existence of an object at some level of 
granularity. 

Determining the level of granularity for 
examining an object is essential and often non- 
trivial. A machine can be described by its 
shape, or the shapes of its components, or by the 
shape of pieces of the components (such as 
screws on a jig). 

Granularity is complicated by the fact 
that many aspects of an object might require 
descriptions in different measurement units and 
at multiple levels of abstraction. For example, if 
a planning application is going to reason about 
the behavior of a machine, it must know the 
machines’ capabilities. The planner must know 
what the machine can do and with what 
precision. To ascertain the interactions among 
objects affecting the machine's precision, we 
may have to examine both a more precise 
granularity of composed objects (e.g., 
interactions of surfaces of the jig and tool) and a 
more precise granularity of time (e.g., 
microseconds as opposed to seconds). Hence, 
we use scenarios describing the same objects, 
but encompassing different time granularities. 

Scenario-based analysis is particularly 
effective because it directly maps elements in the 
domain to models in the interface. Scenario- 
based analysis builds customer confidence 



because he has a better concept of the 
mechanisms underlying any "black boxes”. 
Scenario-based analysis also facilitates 
traceability; because classes originate from 
particular scenarios, questions that a customer 
might have can quickly and efficiently be 
addressed. We tend to involve the customers as 
much as possible in the analysis phase, 
hopefully obtaining the actual scenarios from 
them via interviewing techniques. 

Implementing a Controller 
We hope to implement a version of the 
specification using a multi-agent approach. 
Recent literature in artificial intelligence 
suggests that collections of simple agents are 
much easier to control than large, monolithic 
programs. 

We intend to treat applications and 
portions of the implemented controller as 
intelligent agents (Figure 4). Many of these 
agents, e.g., planners, will be represented as 
knowledge-based applications. Other 

applications, e.g., machine executives, will be 
embedded in wrappers to communicate standard 
objects from the specification. 
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Figure 4: Controller Components as Agents 


interfaces. DAIT supports forward-chaining 
reasoning, procedural programming, functional 
programming, object-oriented programming, 
and deductive database queries. We expect to 
implement both Fuzzy inferencing and 
backward-chaining in the near future. 

Conclusions 

Object-oriented analysis and design provide an 
attractive mechanism for examining the domain 
of robotic control. Scenario-based software 
engineering techniques can often be used to 
achieve consensus among multiple customers 
concerning requirements. The Next Generation 
Controller program successfully used these 
techniques in developing a specification for 
standard robotic controllers in the U.S. 

We hope to soon implement a version 
of the controller by employing multi-agent 
techniques. We will use internally developed 
tools specifically designed for cooperative 
knowledge-based processing in this endeavor. 
By embedding existing software in wrappers 
communicating objects found in the 
specification, we will facilitate interoperability 
and interchangeability of various components of 
our controller. 

The next generation of robotic 
controllers must be innovative enough to support 
avant-garde research concepts such as art-to- 
part manufacturing. However, this specification 
of these new controllers will also have to address 
the needs of supporting current hardware and 
software. We feel that the specification 
adequately addresses these concerns. We also 
hope to soon realize a controller demonstrating 
interoperability and interchangeability of 
components while offering a very high degree of 
functionality. 


Crucial to our implementation of a 
controller will be the Distributed Artificial 
Intelligence Toolkit (DAIT).^’^ DAIT provides 
distributed knowledge-based processing while 
affording transparent processor fault-tolerance. 
DAIT also includes predicates to facilitate real- 
time control. DAIT is based-upon NASA's C 
Language Integrated Production System.* DAIT 
includes tools for metering, configuration, 
interprocess communication, and user- 
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AN END USER'S WISHLIST 
BY CHARLES W. WARD 

It was 1983. I was standing in a Sears store at 5:00 in the 
morning knowing there had to be a better way. We had no one to 
clean the 180,000 sq/ft store and the doors were to open at 9:00 
a.m. sharp. Considering the average cleaner works at a rate of 
3,750 sq/ft per hour I knew I had a problem. There was no way to 
delay the opening by asking management to hold off their customers 
until my crew showed up. The show had to go on! So I rounded up 
who I could on the phone and we limped through the morning, just 
barely finishing as the crowds poured through the doors. This was 
not just a bad day. It was a normal day. That was the year I 
began my search for a mobile robot that could clean. 

This picture of our cleaning operation is not unique. It 
happens everywhere, in every company, every day and in every city 
in this country. Absenteeism is a routine daily occurrence for 
those of us in the cleaning industry. But fortunately, there are 
those employees who do come to work every day. Employees we know 
we can depend on. But, these are the employees everyone wants. 
They are in demand. Therefore, they tend to leave us for more 
challenges, lucrative offers and better careers with more 
advancement and educational opportunities. In addition to the 
problem of employee absenteeism we also have to manage massive 
employee turnover. 

I believe I can summarize the general problems in the cleaning 
industry in the following way: how to fill the 800,000 new jobs 
that will be created in our industry by the year 2000; how to 
reduce turnover from an industry average of 400% a year, how to 
reduce absenteeism and increase morale, and how to attract 
competent labor. Those companies that are experience to these 
problems are looking at mobile robots as a potential solution. 

My purpose today is to share with you some of our ideas about 
what applications might be roboticized and what factors are 
important to us as end users in the cleaning industry. 

A close friend and associate in the mobile robotics business 
once said to us, "The ideal place to apply a robot is where a 
human is working like one." Using this advice as a guide, we can 
identify many opportunities for automation. This is not to say 
that cleaners don't have feelings or emotions, but rather the 
majority of their work is routine, mundane and repetitive. 

Repetitive tasks that we have considered automating are floor 
scrubbing, floor sweeping, floor polishing, vacuuming, trash 
removal and bathroom cleaning. Of course, other tasks like dusting 
and waxing floors could eventually be automated as technology 
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improves. But our opinion is that when you decide what task you 
will automate you should consider the simplicity of the task and 
engineering capability as well as return on investment. 

Some companies we have known have made the mistake of assuming 
that once they have isolated some tasks to roboticize, have run 
some numbers, raised money for development and adapted their 
robotics technology to a cleaning machine they would be ready for 
sales. But, unfortunately, it is not that easy! Our opinion is 
that before developing a robot one must keep three major things in 
mind; the environment that the robot will work in, the market the 
robot will be sold to and the workers it will interact with. 

First, I would like to discuss the operating environment. The 
diversity of building design and the changing environment within a 
building make it necessary for a robot to be built that is flexible 
and easily adaptable. In the cleaning environment things change. 
People move offices, walls and desks. They often leave huge 
obstructions in the hallway at night. The reality is that no one 
wants to move that big box in the hallway. And people will not 
want to go and rescue a robot when they have their work to do. 

We have experienced two schools of thought that attempt to 
address the navigation of a complex environment. The first is to 
teach the robot everything about the building and provide it a map 
for navigation. Even though we believe this is still the most 
efficient way to navigate we certainly can't have a robot that 
takes months to program. 

The second navigational strategy is preprogramming or self 
teaching. We have had some experience with these types of robots 
and while they help to address the simplicity issue they fall short 
in the performance area. Again, the reason is that each 
environment is uniquely its own and the decision about how to 
navigate these changing environments are being left up to a 
machine! I know if I still get lost in buildings a robot is likely 
to do the same! After some experience we believe that 
preprogrammed robots major drawbacks are inconsistency and they 
require too much human intervention time to operate. 

The robots we operate must be consistent performers. 
Consistency is critical to establishing a corporate image of 
i^ality. This is especially true in the cleaning industry. The 
industrial robots that weld, spray paint and assemble the quality 
cars we see today are consistent and reliable. Mobile robots that 
clean must be just as dependable and reliable. We cannot build our 
operation or reputation around unreliable people. The same is true 
for a robot. Please bear in mind, we'd rather have a robot clean 
1000 square feet well every day than a robot that sometimes cleaned 
10,000 square feet and sometimes didn't. 

The second consideration to make will be the market to which 
the robot will be targeted. Although not a glamorous business, 
cleaning is a stable, secure and large industry. Currently the 
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industry is 50 billion dollars a year here in the U.S. alone and it 
is expected to grow to 80 billion by the year 2000. But because of 
the enormous opportunity and the low barriers to entry the industry 
is flooded with competition. This competition is diverse in the 
way it looks at its customers, operations and its people. In the 
cleaning industry you have every kind of businessperson. From the 
mom and pop cleaning companies who operate out of the trunk of the 
car to huge multinational companies who do a billion dollars in 
cleaning a year. The mentality is significantly different. It is 
critical to know who will buy your unit and adapt the sales pitch 
accordingly. I believe it will be most difficult to build a robot 
that can be used throughout the entire industry. We believe 
success depends on finding a niche to begin. 

Pricing robots for the target market is also a difficult 
issue. As I said before, each building is unique and its operation 
expenses vary accordingly. Some buildings contract their cleaning 
and some maintain their own staff. Some buildings have all carpet 
and some all tile, but most a combination of both. We believe that 
in order to effectively market a robot individual studies of each 
location should be done. To get to a real return on investment 
number real data about labor rates, production rates and total 
cleaning cost are mandatory because these figures can vary from 
building to building. For example, general office cleaning labor 
rates in central Virginia are at an average of $5.00 per hour. But 
in industrial plants around the same region they are at an average 
of $11.00. 

We have included a real analysis that my company did for a 
customer in Virginia that is considering automation. This customer 
desired a one year or less payback period and wanted to roboticize 
his vacuuming process. We make the assumption that the robot will 
work primarily in the wide open spaces within the building where 
there is little obstruction. In this example, where we have shown 
a 7 month payback on a $22,000 investment for two robots, it is 
important to note that there is no human intervention time 
required. The more human intervention required the longer the 
return on investment will be. (See Exhibit A) . 

In addition to adaptability, consistent performance and target 
market the last consideration and the most important factor is 
people. Who will the robot interact with? Who will run it? Who 
will service it? And who will reeducate an industry that has not 
had any major innovation since the vacuum cleaner? It is the 
biggest mistake of all to assume once you have produced an 
efficient and reliable machine that the industry will be beating 
down your door to buy one. There will be an enormous educational 
problem for our industry once technology has begun to be 
integrated. 

Producers of mobile robots must be prepared for that 
educational challenge. They must be prepared to participate in 
reshaping, rethinking and supporting an industry that is inevitably 
facing major changes in the next century. Designers and developers 
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number one barrier to market acceptance is this human factor. It 
is critical to develop robots that serve people, not ones that 
people need to serve. Those robots thought of as "user friendly”, 
however vague the term, will be accepted quicker and stand the 
greatest chance of success. By "friendly” I mean things like; ease 
of programming, self diagnostics, manuals written in simple 
english, ”service"-minded technicians, the absence of hidden costs 
and reliability. 

I have listed some detailed operational specifications I think 
are important to consider when developing a robot for the cleaning 
industry. (Please see "The WishList” Exhibit B) . But ultimately 
the success of a mobile robot in cleaning will be dependent on the 
same thing as it is for any product; how the company who sold the 
robots supported and serviced the people who are using it. 


665 


EXHIBIT A "CASE STUDY" 


KNOWN: 

750.000 Cleanable square feet 
(All areas) 

375.000 square feet carpeted 


125,000 square feet open and 
unobstructed area ideal for 
robot application 

Cleaning done between 6:00 p.m. 
and 10:00 p.m. Monday through 
Friday or 260 days per year 

Cleaning done using part time 
labor 

Human can vacuum 6000 square 
feet per hour 

Wage per hour is $6.00 
including taxes and insurance 


ASSUMPTIONS: 

Robot per unit cost: 

$ 11,000 

Robot annual maintenance: 
$500 per unit 


Robot production rate: 
9000 square feet per hour 

Robot to clean 365 days 
per year 


Robot to work 6:00 p.m. 
to 1:30 a.m. daily 

Expected life of robot: 
8 years 


ANALYSIS 




HUMAN 

ROBOT 

Daily hours 

needed to vacuum 

21 

14 

Days worked 

each year 

260 

365 

Annual hours 

needed to vacuum 

5,417 

5,069 

Annual labor 

cost 

$32,500 



Total robot investment (2 units) $22,000 
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PAYBACK PERIOD 


$22,000 (IMVESTMEMT) 

Annual labor - Annual maintenance 
$32,500 - $1,000 
* .7 Years (8 months) 

Annual savings = $31,500 
Over the 8 year life of the robot: $230, 0004- 

RETURN ON INVESTMENT 
Annual labor - Annual maintenance 
$32,500 - $1,000 

$22,000 Investment 
= 143% Return on Investment 
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EXHIBIT B: "THE WISHLIST" 


1 Robot should be easy to program 

2 Robot should be self charging 

3 Robot needs to be truly "Autonomous" 

or able to run without any human intervention 

4 Robot needs to be able to navigate multiple hallways 

5 Run time should be a minimum of 4 hours 

6 Cleaning path should be as wide as possible but robot should 

be able to go through standard doorways 

7 Robot should be able to call elevators and move from floor to 
floor without human assistance 

8 Routine maintenance and servicing should be similar to non 
robotic equipment. Unit should be able to perform self 
diagnostics 

9 Robot should run without any external navigational aids 
like bar codes, reflective tape or other devices that will 
modify a customer's building. 
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THE FIRST COMMERCIAL FLOOR CARE COMPANY THAT VENTURED INTO THE 

PRODUCTION OF ROBOTICS 


Allen J. Bancroft 

Robotics Project Engineer/Manager 
The Kent Company 
Elkhart, Indiana 


Abstract 

This paper will cover the history of 
innovation at the Kent Company, its 
acquisition by AB Electrolux of Sweden, 
Electrolux's vision that robotics would 
control the future, and the evolution of 
robotic floor care equipment. The paper will 
cover early prototypes, the learning curves 
associated with robotic floor care equipment 
development, bloopers, and the introduction 
of the RoboKenr^ Scrubbervac to the 
cleaning industry. 

Kent - An Historical Perspective 

The central system vacuum patented 
by Gorden Kent eighty years ago launched 
the company which still bears his name. 
Kent, with his father and two brothers, 
began manufacturing in Rome, New York, in 
1 91 3. Innovations in floor care including 
portable vacuum cleaners and floor 
machines established the company as a 
viable force in the industry. 

The Kent Company merged with Finnell 
Systems in 1962. Finnell was founded in 
Missouri in 1916 and relocated to Elkhart, 
Indiana, in 1928. Kent continued to 
manufacture in both New York and Elkhart 
until after AB Electrolux acquired the 
company in 1969. Operations were 
centralized in Elkhart in 1971 and Kent is 
now one of the oldest full line manufacturers 
of floor care equipment in the United States. 

Product innovation has always been a 
Kent Company hallmark. Kent patented the 
offset design floor machine in 1 929 and 
introduced the revolutionary MICROSTAT® 
vacuum which filters virtually 100% of 
bacteria-laden particles from the air for use 
in hospitals in 1 959. With its acquisition of 
Kent, Electrolux reemphasized innovation in 
an industry still often associated with mops 
and buckets. "Floor Care Made Easy" is 
Kent's signature and, with a focus on 
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robotics, the tag, “The Future Is Here" has 
been added. 

Electrolux - An Historical Perspective 

AB Electrolux, a Swedish company, is 
one of the world's leading producers of 
household appliances. It leads the 
European market and, as owner of 
Frigidaire, is the third largest producer in the 
us. The group is also the world's largest 
company in floor care products and 
absorption refrigerators for recreational 
vehicles and hotel/motel rooms. Outside of 
household appliances, the Electrolux group 
is one of the world's top two companies in 
the areas of food service equipment, 
industrial laundry equipment, forestry and 
garden equipment, refrigerator 
compressors, and car safety belts. 

The Electrolux group has 
approximately 600 operating companies in 
more than fifty countries and, since the 
eighties, has been concentrating on 
investments in new product design to carry it 
into the future. 

Robotic Development 

Early in 1 986, Electrolux became 
interested in the possibility of robotizing 
several different types of products. 

Electrolux had previously been involved with 
Unimation, a company founded to focus on 
the development of industrial robotics. 
Joseph Engelberger, considered to be the 
father of industrial robotics, founded 
Unimation in the late sixties and sold it to 
White Westinghouse in the early 1 980s. 
When Electrolux contacted Engelberger in 
1986, he had recently founded Transitions 
Research Corporation, a start-up company 
geared towards the development of mobile 
robotics. Engelberger's background also 
included the successful design of assembly 
robots for the country's major automotive 
manufacturers. 


Electrolux asked TRC to look into the 
feasibility of robotizing iawn mowers, fioor 
care equipment, and a househoid robot 
similar to “Rosie" on the TV cartoon, "The 
Jetsons." Electroiux envisioned robotic 
equipment of all types as the future in 
manufacturing and productivity. They knew 
it would hctppen, but they just didn't know 
how soon. What Eiectroiux did know was 
that it wanted to be the company known as 
the leader in robotic technoiogicai 
development and manufacturing. 

It was decided to first concentrate on 
developing a robotic fioor scrubber. 
Electrolux was so convinced that the future 
was in robotics that they sent two engineers 
from Sweden for a two year period to do 
technology transfer back to Stockholm. 

The First Prototypes 

When Eiectroiux decided that the fioor 
scrubber wouid be the first robotic 
experiment, they directed The Kent 
Company to send TRC a standard 32" 
autoscrubber. 

When the machine arrived at TRC, 
there were many meetings to determine 
where the drive motors wouid go, where the 
electronics would be mounted, how the 
machine wouid turn on a soapy fioor, how to 
have the brush head and the squeegee go 
up and down, and how to design the 
machine for use in manuai mode as weii as 
in automatic mode. And those weren't the 
only major hurdles. The machine had to 
cope with people and other obstacles and, 
most importantly, still be able to do an 
excellent cleaning job. A robotic scrubber 
that moves around on the fioor aii by itself 
isn't any good if it doesn't ciean as weii as 
the traditionai equipment. 

Several months of engineering resuited 
in the birth of "Big Bertha." This machine 
marked the first attempt to robotize a piece 
of floor care equipment, initiaiiy tested in a 
local school scrubbing fioors, this first 
prototype did indeed prove Electroiux's 
vision that floor care equipment could be 
robotized. After more preiiminary testing, 
"Big Bertha" was retired. Eiectroiux then 
decided that it was important to get 
marketing input on what a robotic scrubber 
should do. It was decided that the first 
autoscrubber, "Big Bertha," was too big to 


transport around the country for field tests. 
Therefore Kent sent TRC smaller 
autoscrubbers to be transformed into robots 
so that it would be easier to transport and 
test them in real life situations. TRC began 
robotizing three Kent 20" autoscrubbers 
early in 1988. These machines had 11- 
gallon solution and recovery tanks making 
them much lighter than the 32" machine with 
two 27-gallon tanks. 

The first 20“ autoscrubber was 
robotized and configured several different 
ways. Electronics were first installed on the 
side of the machine. It soon became 
apparent that that location was susceptible 
to damage on turns or when something 
bumped the side of the machine. Next the 
electronic components were mounted on the 
top of the machine. That wasn't feasible 
because it was too easy, while filling the 
tanks from the top, for overflow to run 
directly into the electronics box. Next we 
mounted the electronics on the front of the 
machine and found out that it was too 
difficult to protect them from bumps head- 
on. We finally decided that the most logical 
place for the majority of the electronic 
components was on the rear of the machine. 
Installing the electronics on the rear of the 
machine also made it very convenient for 
them to be serviced. The rear placement of 
the electronics is still the optimum choice. 

We also went through several stages of 
deciding what the main obstacle sensors 
would be. An early prototype had a scanner 
mounted on the front of the machine. That 
didn't provide enough information and it was 
extremely susceptible to breakage. 

Early on we mounted sensors on the 
right side of the brush head so that the 
machine could scrub right up to the wall. 

First we mounted a paddle that rubbed right 
against the wall, but end users didn't like the 
fact that the paddle sometimes marked the 
wall. We then went to a thick foam on the 
brush head that was sensitive and cleaned 
right up to the wall. That scuffed the wall 
too, so that idea was discarded. Our final 
wall-tracking device was a non-contact 
sonar sensor, similar to what is used in a 
consumer camera to measure distance. 

At this point we had to face the fact that 
it would be best to design the cleaning 
equipment around the robotics (navigation) 


rather than robotizing an existing piece of 
equipment. It's much better and easier to 
add the cleaning device once the vehicle 
navigates properly. Since TRC didn't 
specialize in cleaning, however, a 
completed vehicle had to be used to 
develop those early prototypes. Three 
prototypes were ready for alpha testing in 
early 1989. 

Late Night Testing 

At that point, Kent assigned a sales 
manager from the East Coast to monitor the 
progress of the robotic development and 
provide input on real life scenarios in the 
cleaning industry. 

Those first tests launched a series of 
incidents that would fill a book of bloopers. 
We chose a local grocery store in Danbury, 
Connecticut, as the first testing site. Late 
night tests with this space-age robot caused 
quite a bit of excitement. We used a 24- 
hour grocery store so that we could do our 
tests in the early morning hours when fewer 
people were shopping. In one of the very 
first trials with the robot going down the 
aisle, a little grandmotherly-looking lady 
walked by the other end of the aisle with her 
grocery cart. Here we are at 1 :00 a.m. in the 
morning with a piece of equipment that 
appears to be straight out of a science 
fiction movie. The little old lady pushing her 
cart across the end of the aisle notices the 
robot out of the corner of her eye, stops, 
turns towards the robot, and sees it coming 
straight at her. Then she looks up towards 
the other end of the aisle and sees fifteen 
people, gasping for air, hoping that she will 
move out of the way and let the robot pass. 
Instead she froze in place. I started 
sprinting down the aisle knowing we were 
going to have an emergency medical 
situation, that the robot would hit her. I just 
didn't know whether she'd have a heart 
attack first or be run over. Lo and behold, 
the robot planned its path right around her 
and continued cleaning the rest of the area 
without a mishap. We knew at that point we 
were definitely headed in the right direction. 

A trip or two later to the same grocery 
store we were a little less lucky. As the robot 
was doing its last cleaning pass, it 
encountered a free-standing display of soup 
cans set up at the end of an aisle as a 


"special of the week.” After our last 
experience, we didn't doubt for a minute that 
the robot would move around the display. 
That's when the corner of the brush head 
caught the bottom of the display and sent 
400 soup cans rolling around the floor. 

Today we laugh. At the time, the store 
manager, Kent's salesman/consultant and I 
didn’t think it was the least bit funny. Today 
we are very close to introducing robotic 
cleaning equipment for supermarkets based 
on some of those early tests. 

Following those first several months of 
testing it was time to move on to the next 
step - limited production. 

Making The Bio Move 

TRC was a tremendous help in 
developing the initial three prototypes, but 
Kent decided to develop a robotics 
department at its Elkhart location so that 
development and floor care equipment 
manufacturing would be at the same site. It 
seemed wiser to hire someone from TRC 
who was familiar with the project than to try 
to find someone new. New Years Day is 
always the time for new beginnings, so it 
seemed appropriate that on January 1, 

1991 , 1 moved to Elkhart to develop Kent's 
robotics program. TRC was pleased that 
someone who had worked on the project 
since its inception would continue because it 
made for excellent communication and 
interface from manufacturing to TRC. 

Kent’s Commitment To Robotics 

The Kent Company shared the parent 
company's vision of robotics as the way to 
increase productivity in floor care. Although 
TRC would still work with us, it soon 
became apparent that we would have to 
develop a fully staffed department. The first 
hire was a software engineer. Since then 
our group has grown to a staff of seven full- 
time people dedicated solely to robotics 
development. Since Kent is an original 
equipment manufacturer of industrial 
cleaning equipment, it was necessary to hire 
the majority of the robotics staff from outside 
the company to acquire the talent we 
needed. The staff consists of technicians, 
software engineers, mechanical engineers, 
draftsmen, and electrical engineers. 


Kent's Robotic Learning Curve 

We learned invaluable lessons from the 
many mistakes and missteps that always 
result from a venture into the unknown. In 
the process of training a new software 
engineer, I told him to put the first prototype 
on a lift table. It was necessary to do some 
repair on the navigation system which had 
developed an intermittent connection 
probiem. Although not completely familiar 
with the machine, he lowered the lift table, 
drove the robot up the ramp, and raised the 
lift table to its full five feet so we could do the 
work. While I took a phone call, he studied 
the electronics and patiently waited to get 
his hands on the machine for the first time . 
When I hung up the phone, he told me he 
thought he had discovered the problem. Not 
knowing that the machine was stiii on, I said, 
"great." What he didn't know was that 
disabling or unplugging the handle bar 
control at that time made the machine run 
full speed in reverse. As he leaned fomvard 
to show me the loose wire causing the 
problem, I panicked. Before I could warn 
him, he reached into the controls and 
touched the loose wire. It broke completely, 
and it was the problem all right. The 
machine went full speed in reverse, he on 
one side, me on the other, and the machine 
five feet up in the air. We tried to hold it but 
couldn't. The machine landed on its back 
full of water and batteries. We both sat 
staring at the only prototype machine we 
had, now a $50,000 pile of components, and 
wondered whether we would still have jobs 
the next day. With the machine in ruins, 
everyone in the group pitched in for a 
continuous eighteen-hour repair job. From 
that we learned to design the machine so 
that when the handlebar is unplugged, the 
machine will not move. Many of the built-in 
safety features actually came from real life 
experiences making today's RoboKent a 
real user-friendly robot. 

Marriott's Involvement 

With a robotics staff in place and 
development well on its way, it was time to 
think about selling robots. Electrolux had 
been talking with the Marriott Corporation 
about an exclusive agreement, a joint 
venture situation, in which Marriott would 
share in development costs as well as buy 


all of our prototypes and production for a 
period of two years. While Marriott is 
usually thought of in connection with 
lodging, that was only one of their smaller 
business segments. They are also involved 
in airport food services and managing 
cleaning facilities in hospitals, office 
buildings, and school districts. Once an 
agreement was reached in February, 1991 , 
we started installing prototypes. We looked 
for local, visible sites. Marriott decided to 
start in Chicago where they had two big 
hospital accounts. 

First Prototypes Installed At The End-User 

The first machine was installed at 
Northwestern University in Chicago. As far 
as we know, this was the first commercial 
robotic floor care equipment actually sold 
and in place at an end-user location. All of 
the Marriott management team from the 
main office in Washington, DC, was there 
for what was to be history in the making. 

This was to be another learning 
experience. As we unveiied the RoboKent 
scrubber for all of Marriott and the hospital 
personnel, you could feel the excitement 
permeate the air. Dozens of people 
watched as the robot went up and down the 
basement aisle scrubbing, cleaning, and 
drying in one pass. In fact, everyone stood 
quietly in awe as the RoboKent scrubbed 
the floor perfectly. On its final cleaning 
pass, a janitor, caught unaware, happened 
to walk by at the opposite end of the hall. 
He looked up in shock at the machine 
moving by itself and then he saw his boss 
standing with me on the far end of the 
corridor. He ran towards the machine 
shouting at the top of his lungs, 'Til get that 
run-away machine, Richard!". His boss 
yelled down the corridor, "Leroy, leave that 
robot alone," and Leroy responded, "What 
robot?" We laughed but this incident was 
repeated several times with other 
employees in other locations. That taught 
us to be very aware of the impact of robotic 
equipment in action on people and to be 
prepared for unusual reactions. ..something 
that we still caution people to consider and 
prepare for today. 

After we installed Northwestern's 
machine and a machine at The Rush 


Presbyterian Hospital in Chicago, we 
recognized some limitations. 

The Chicago machines were hand built 
as prototypes, not necessarily machines to 
be left in the hands of an end-user. After a 
few trips to Chicago to repair loose 
electronic connections as well as to do 
repair on safety sensors, we knew further 
development was imperative. 

Programming the machine was the 
next immediate challenge. When we first 
installed the machines, they were 
programmed by walking them down the 
hallways and entering data into their on- 
board computers as we went. That was 
great the first time we went to the hospitals. 
We programmed the several hallways that 
the customer indicated and left. Two weeks 
later the customer called back and reported 
that the robot was running fine; please come 
program the other thirteen fioors of 
hallways. That task in itself took MANY 
days. We realized we couldn't go into 
production if we had to travel to every end- 
user location to program a robot for each 
and every hallway. Besides the fact that the 
initial installation could take several days, 
hallways always seemed to be under 
reconstruction or reconfiguration. That 
resulted in requests for us to come and 
reprogram those sections. There were too 
many limitations to list, but the reality was, 
manually teaching production robots for an 
industry where floor patterns can change 
from day to day was not going to work. It's 
not like an industrial robot where it's 
programmed once and the robot will perform 
the same task time and time again in an 
unchanging environment. In addition, 
installation costs could surpass the actual 
cost of the machine. This forced us to look 
into other programming methods and from 
that sprang the "Auto Learning" process, 
patent pending. 

Patent Pending "Auto Learning " 

The refinement of "Auto Learning" was 
the first major engineering victory for Kent’s 
robotics department. I felt that if we could 
manually walk a machine to teach it and 
enter data into the robot's computer, the 
robot should be able to teach itself while it 
traveled the hallways, entering data into its 
own computer. I was told I was crazy by 


several well-respected people in the mobile 
robotics field. I saw then that this task was 
going to be all ours; no one else believed in 
it. Being told that a robot couldn't teach 
itself made me more determined than ever 
to make it happen. Two and a half months 
later we had the first prototype running in 
our warehouse in Elkhart. 

That was just one of many major 
challenges that brought vigor and vitality to 
Kent's robotics department. It also brought 
Kent to the forefront of robotics technology. 
Now, knowing that we could install robotic 
scrubbers at an end-user's site without 
being required to go back to program the 
machines, we were ready to do a limited 
production run of prototypes, the third 
generation. 

With the old-style programming, once 
hallways were programmed, the operator 
had to remember which program 
corresponded to what area. The operator 
needed to either remember hundreds of 
hallway numbers or he had to have some 
sort of chart mounted on the machine. 

Once we updated the software in the first 
two Chicago sites, the operators' gratitude 
knew no bounds. Unlike those who get a 
robot today, they really knew the old 
fashioned way. 

After three months of testing with “Auto 
Learning” installed, Marriott decided they 
wanted ten more machines to spread 
around the country in various geographical 
locations and situations. 

These ten machines were put into 
operation in late 1 992 with the idea that we 
would get direct feedback from Marriott 
about how the machines worked in real-life 
environments, if they were user friendly, 
and, most importantly, their productivity 
quotients. The goal was to understand the 
expectations of end users and how, if at all, 
the next generation machines couid be 
made more productive. 

Prototype Testing Complete 

After eighteen months of testing on the 
second and third generation machines, it 
was time to evaluate all input from the field 
in conjunction with the engineering ides that 
we had compiled. 

We learned that a machine with an 1 1- 
gallon solution tank and an 1 1 -gallon 


recovery tank needed to be serviced 
approximately every thirty-five minutes. 

That certainly didn't increase productivity. 

We also found out that after four hours of 
run time the batteries needed to be 
recharged. That meant that the robot sat 
idle for up to nine hours during the recharge 
time. That didn't increase productivity 
either. The chemicals put in the solution 
tank couldn't be flushed out because there 
were no dump fittings on the solution tank. 
Again it was apparent that we needed to first 
develop the robotics and then add cleaning 
components. One big complaint was that 
the machines really looked like we took an 
existing autoscrubber and hung electronics 
on it. It was an unwieldy sight and not very 
eye appealing. Those ideas and views 
came from end users. Now it was time for 
the input from the robotic team's 
perspective. 

We decided we needed a better design 
to make servicing the electronics easier. 

The original twelve machines, still in 
operation today, are serviced by our in- 
house technicians. Larger production runs 
would require making the electronic 
components more accessible and fail-safe 
for field service people to replace. 

Generation Four 

Following two months of information 
compilation, we were ready to start 
redesigning. It was finally time to engineer a 
machine from the ground up to become a 
totally robotic scrubber, not take a scrubber 
and make it into a robot. 

We designed larger tanks, developed a 
slide-out exchangeable battery pack, and 
incorporated means for dumping both the 
solution and recovery tanks. 

Production Set-Up 

Building a robot in the laboratory was 
one thing. Building a robot in the 
manufacturing area was something else 
altogether. We selected three people out of 
our factory to be dedicated to the 
manufacturing of robotic equipment. We 
also had to set up a custom line conducive 
to the safety of handling electronics. We 
went into full line production in January of 
1992. Although it took a few months, our 
production line now runs without the 


supervision of the robotics group. Factory 
personnel tell us that building the robot is 
just like building any other piece of 
equipment. 

To date, we've assembled more than 
eighty robotic scrubbers on that assembly 
line. That line was also designed to support 
the manufacturing of our robotic power 
sweepers, robotic burnishers, and other 
robotic floor care equipment now in 
development and testing stages. 

A secondary benefit of doing 
development at the manufacturing location 
is that there is plenty of room to test the 
robots after they've been assembled. 
Additionally, all robotic floor care equipment 
is tested for its quality of cleaning by factory 
personnel who are thoroughly 
knowledgeable about the cleaning industry's 
requirements and concerns. 

Conclusion 

Making floor care easier and more 
productive has long been The Kent 
Company's mission. To achieve that 
objective, we are devoted to staying at the 
forefront of technoiogy by investing in long- 
range development. Productivity is 
accomplished by making the installation and 
use of robotic equipment easy for 
maintenance personnel who do not have 
engineering degrees at a cost that will 
generate a high return on investment in a 
short time span. With costs being squeezed 
from budgets and fewer personnel being 
asked to do more. The Kent Company 
believes that the development of robotic 
technology and equipment in the US will 
have dual benefits. More manufacturing 
jobs will be created and the amount of work 
maintenance personnel can accomplish 
within allotted time frames will be increased. 

We believe that the future is robotics 
and that "The Future Is Here." 
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Abstract 

Non-geometric hazards (i.e., those which cannot 
be characterized soley by their shape, but 
instead are related to mechanical properties 
such as strength and friction) may pose a 
significant risk to planetary rovers. This paper 
describes a means for an articulated vehicle to 
detect sinkage and slippage in such material so 
as to prevent entrapment and to correct for 
dead-reckoning errors. Simulation results and 
preliminary indications of test data are 
described. 

Introduction 

For an exploring vehicle to move safely over the 
surface of another planet, it is potentially 
important to know if the vehicle is sinking into 
very soft surface material or is experiencing 
high levels of wheel slip. For example, at the 
Viking 1 landing site, about 14% of the surface 
is drift material, and one of the landing legs 
sank 17 cm into that material.’ Previous 
studies of non-geometric hazard detection for 
planetary rovers, which assumed very large 
(-1000 Kg) rovers, have focussed on Ground 
Penetrating Radar to detect subsurface 
hazards. However, mass and power 
constraints for microrover missions lead us to 
desire means to detect these hazards without 
requiring additional mass, power, or complexity 
beyond the basic vehicle configuration. 

For the purposes of this discussion, we consider 
the mission model of NASA’s Mars Environmental 
Survey (MESUR) Pathfinder project, scheduled 
for launch to Mars in November, 1996. In this 
mission, a microrover with a mass of under 10 
Kg will traverse over the terrain within a few 
tens or hundreds of meters from it's lander to 
goal points selected by ground-based analysis of 
images taken by lander stereo cameras. These 
goal points are selected for their scientific 
interest, and it is important that they be 
approached quite accurately (for example to take 


a spectrum of a particular rock), t hus safety 
and improvement in the accuracy of 
dead-reckoning navigation are important reasons 
to develop a reliable means for estimating the 
sinkage and slippage of the rover wheels. Means 
for detection and avoiding geometric hazards are 
described elsewhere.^ 

The Pathfinder rover is a six-wheeled 
‘rocker-bogie* articulated vehicle. It will be 
functionally equivalent and the same size as our 
research vehicle ‘Rocky 3.2‘, shown in Figure 1. 



Figure 1 . Rocky 3.2 vehicle 
with laser stripe sensors 


Rocky 3.2 (one of a long line of Rockys) has 
sensors for wheel speed (all wheels are driven) 
and for determining the articulation angles of 
the chassis. (The articulations are passive so 
that each wheel follows the terrain contours 
independently.) It also has a look-ahead ranging 
sensor based on detecting, in a CCD image, the 
position of laser stripes projected ahead of the 
vehicle (Figure 1 is reproduced from a color 
original with a blue filter so the red laser 
stripes show as dark lines). Because the 
computation on-board the rover is very limited 
(an 8085 CPU, about 20 times less powerful than 
a typical personal computer), it is important 
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that only a small amount of sensor data be taken 
and processed. Thus it is important to 

formulate simple algorithms for estimation of 
slippage and sinkage, and to do performance 
evaluation based on the concept that only the 
wheel, chassis, and a minimum number of 
discrete measurements from the look-ahead 
sensor can be used as input to the system. If a 
simple algorithm gives good performance, in 
terms of improvement of dead reckoning vs 
basic odometry and in detection of hazardous 
sinkage conditions, then the increased 
computational load will be justified. 

The Sinkage and Slippage Model 

We consider a planar model as shown in Figure 2. 
Specifically, there are three wheels connected 
with passive but instrumented linkages so that 
they remain in contact with the soil as they roll. 
By processing the pitch and articulation sensor 
values we can compute the difference in 
eievation between the rear wheel and the center 
or front wheels (call these z(1)and z{2), 
respectively). We also have a look-ahead 
ranging sensor which examines a number of 
discrete points on the ground ahead of the 
vehicle. Again, by processing the sensor data, 
we can compute the elevation difference 
between the rear-wheel nominal contact point 
and the elevation of each sensed point on the 
ground ahead of the vehicle (call these z(3)... 
z(N)). Needless to say, all these measurements 
have noise which must be accounted for in the 
analysis. 

Assumptions 

We assume that undisturbed terrain in this 
planar model has an elevation function y(x), 
where y is the elevation at a point x along the 
horizontal axis. When the vehicle moves ahead, 
the front wheel sinks in the soil by an amount 
s(x), so that it rolls along in contact with the 
function y(x)-s(x). We assume that the trailing 
wheels do not further compress the soil (since 
the wheel loading of this vehicle is roughly 
uniform). Thus they also track y(x)-s(x). This is 
a key assumption which, if not approximately 
correct, will lead to a general failure of the 
entire approach. If the wheels all turn at the 
same rate (which is reasonable since they are 
geared so low that in normal terrain they run 
effectively at the no-load speed), then when the 
wheel circumference has moved a distance w the 
vehicle will advance some distance x in the 


horizontal direction, usually less than w, due to 
wheel slippage. This slippage will generally be 
a function of the type and slope of the soil. 


y 

u. 



Figure 2. Pianar modei and symbol definitions 
Objective 

The objective of this analysis is to estimate x 
and s(x) given the "odometer" reading w, the 
values of z(1)...z(N), and the associated 
measurement noise v(1)...v(N). Intuitively this 
should be possible, since if y(x) and s(x) were 
known exactly up to the forward-most sensor (a 
ranging sensor for y and the front wheel for s), 
then for a given Aw, there would usually be a 
unique Ax which would allow all the sensor 
readings to match their predicted values. In 
other words one would "slide" the rear and 
center wheels along the curve y(x)-s(x) until the 
observed elevation difference z(1) is matched 
between x+Ax (the new position of the rear 
wheel) and x+Ax+d(1) (the new position of the 
center wheel), which would fix Ax. Then one 
would use the measured elevation of the front 
wheel to compute y-s at that point (thereby 
extending our knowledge of s(x) forward by Ax. 
Similarly, we would use the measured elevation 
at the forwardmost range sensor to extend our 
knowledge of y(x) by Ax. This process would 
repeat so as to build an arbitrary sequence of 
Ax, s(x+d(2)), and y(x+d(N)) values. We would, of 
course, assume a yo(Xo) value as the starting 
elevation and position of the rear wheel. 
(Knowledge of the initial y(x) and s(x) functions 
between x and x + d(N) is trivial since the 
vehicle will disembark from the lander along a 
ramp of known geometry and with negligible slip 
and sinkage.) 
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One potential problem with this approach is that 
values of z(1)...z(N) will not be taken densely 
along the vehicle trajectory. Actually, the 
processor on the vehicle is sufficiently slow and 
burdened with other activities that the 
navigation and mobility sensors are only 
monitored roughly every wheel radius of 
forward traverse. This is often enough to ensure 
that rocks, craters and other geometric hazards 
can be detected and avoided (an issue not 
addressed in this paper). 

There are several issues which need to be 
considered with this model. First, if 

Z(k)=col(z(N) z(1)) at cycle k is measured on 

terrain which is very flat (compared to the 
measurement uncertainties V(k)) then we would 
still like to have a reasonable estimate of 
forward travel. This suggests that we should 
have a prior model of the distribution of the slip 
x(w), and that we should form a Maximum A 
Posteriori (MAP) estimate of the slip.® 

Following our heuristic argument above, if we 
were to "slide" the vehicle along until the 
observed elevation difference z(i) is matched, 
this corresponds to generating a discrete set of 
values y(i), i=0,...M1 and s(i), i=1,...M2 which can 
be thought of as our best estimate "histograms" 
(i.e., discretized piecewise constant 
representations) of the y(x) and s(x) functions. 
The horizontal density of these estimates should 
be sufficiently great to allow accurate models 
of the terrain for purposes of simulation, but 
not so great as to unduly burden the processor. 
Since the wheels mechanically average the 
terrain over a length equal to the tire contact 
patch (about a third of a wheel radius) we would 
tend to discretize the model at about this level. 
Thus we might have M2=30 or so and M1=60 or so 
(the actual Rocky 3.2 vehicle has 13 cm dia. 
wheels and an overall length of 60 cm, with the 
look-ahead sensor reaching about one vehicle 
length). 

Thus we can now outline a procedure for 
estimating the sinkage and slippage of the 
microrover: 

1) Measure the elevation differences z(1)...z(N). 

2) Use previously-estimated (described below) 

histograms y(i), i=0 Ml and s(i), i=0,...,M2, as 

well as a Gaussian prior distribution for Ax with 


mean m^ and variance compute the 

(nonlinear) MAP estimate for Ax. We assume the 
distribution for measurement noise for each z(i) 
is also independent and Gaussian. Since the MAP 
estimate of independent Gaussians is a weighted 
least-squares estimate, we compute: 

mini(j, 3 X^-'[( 1 /o,(j) 2 )(zG) - y(dO)+i) - 
y(i)+s(i))2] + ( 1 /o,(i, 2 )(z(i) - y(d(1)+i) + 

s(d(1)+i) - y(i) + s(i))2 + ( 1 /Ox 2 )(i-mj 2 ^ 

The interpretation of this expression is as 
follows: to maximize the posterior probability, 
which is the product of exponentials, we need to 
minimize the magnitude of the exponent. If we 
let i be the histogram bin which we assume the 
rear wheel has advanced to (and changed to an 
elevation y(i)-s(i)), then the summation from 
1=3 to N-1 is of squared errors between the 
ranging sensor elevations and the corresponding 
y values in the histogram. The next term is the 
weighted squared error for the middle wheel, 
incorporating the histogram data for s(1) as 
well as y(1). The last term is from the Bayesian 
prior distribution. Note that z(2) does not even 
appear in this expression, as the advance of the 
front wheel involves an unknown amount of 
sinkage in the soil and so there is no histogram 
data with which to compare. A similar situation 
arises with z(N) in the summation, since y(x) is 
unknown ahead of the forwardmost sensed point. 

We implicitly assume that the forward advance 
is not so great as to push the next sensed point 
z(N-1) off the end of the histogram, although 
this could be accounted for if necessary. We 
would then perform a parabolic interpolation of 
the weighted-sum-of-squares to get a refined 
estimate of Ax to a fraction of a histogram bin. 
While not strictly valid, interpolation of the 
error function should be better than taking 
integer bins, while not as computationally 
intensive as the more conceptually-correct 
approach of computing the minimal error 
function on interpolated data. Note also that we 
could compute m^ as a function of the data here 
prior to finding the minimum over i to account 
for the fact that our expected slip is a function 
of average terrain slope. For example, we could 
compute 


(,^,Z^z(j)/dG)))/N 


as an estimate of the slope and compute some 
linear or non-linear function of this to compute 
m^. We could also modify the estimates of m^ 

and 0^2 using prior estimates to adapt and refine 
our Bayesian prior. 

3) Now that we have an estimate for Ax, we 
translate the histograms for y and s forward by 
Ax and up by y(Ax). This requires interpolation, 
due to the non-integer nature of Ax, so we 
assume that linear interpolation between 
adjacent points is adequate (again to reduce 
computational complexity). We extend our 
knowledge of y forward by linear interpolation 
from the translated old y(N) value to the 
observed z(N) at d(N). Similarly, we extend our 
knowledge of s forward using linear 
interpolation from the translated s(d(2)) to a 
new forwardmost value s(d(2))=y(d(2))-z(2). 

4) We need some way to incorporate the new 

measurements into the histogram for y 
(otherwise only the forwardmost measurement 
y(N) will play a role in defining the function, 
which seems to waste a great deal of valuable 
information). Note that between the old d(N) and 
the new d(N) we have a linear approximation to 
y(x). When the vehicle moves forward by Ax 
(generally less than d(N)-d(N-1)), we will get a 
new value for y from z(N-1) which will, in 
general, not lie on the previous linear 
approximation to y. Since we expect that our 
measurement noise will be quite small 

compared to the grossness of the linear 
interpolation, we would like to force the 
histogram to conform to the data at this point 
(the new d(N-1) point). We would also expect 
y(x) to be a continuous function, so that nearby 
points should also be modified. For simplicity, 
we will assume that adjacent histogram bins 
will be updated by "splitting the difference", i.e. 
they will be reassigned values halfway between 
the new measurements of y based on each of the 
z(i) measurements for i<N and the old (but 
translated) histogram value. This is an ad-hoc 
assumption made in the interests of 
computational simplicity which will hopefully 
allow a fairly accurate estimate of y(x) to be 
generated as all of the sensors sweep over the 
surface. We can perform a corresponding 
process for s(x) by assuming that deviations 
between z(1) and y(d(1))-s(d(1)) are due to 
errors in the measurement of s and not y, which 
makes some sense because by this time the 


histogram for y has been refined with multiple 
measurements while the histogram for s has 
been generated only by piecewise linear 
interpolation out to the single measurement at 
z(2) (i.e. the front wheel). 

5) Lastly, move the vehicle forward and repeat 
the cycle. 

This model and analysis are very simple and 
somewhat suspect from a theoretical 
point-of-view. However, as in many practical 
applications, real-time performance and 
computational complexity are of paramount 
importance, with the alternative being not to do 
any estimation at all. Thus we would like to 
know what the performance of this simple 
estimation procedure is, and to what degree it 
gives improvement over use of the prior mean 
m^ to estimate over-the-ground distance 
travelled and not estimating sinkage at all (and 
accepting the risk of getting stuck). We would 
also like the evaluate the usefulness of having 
more ranging sensor measurements as opposed 
to fewer, since each additional measurement has 
cost and may only be needed for this purpose (as 
rocks and craters may be detectable with as few 
as two look-ahead range points). If possible, we 
would like to also have a way of choosing the 
distances d(3)...d(N). 

Thus what remains to be done is 1) perform an 
evaluation of the performance of the system by 
estimating the variance in the slippage and 
sinkage estimates by Monte Carlo numerical 
simulation (since the nonlinear MAP formulation 
is intrinsically iterative and because we want 
to explicitly incorporate the effects of 
quantization into the histogram bins, the effects 
of resampling and interpolation, etc.). This 
simulation will evaluate the effects when the 
data are not drawn from a Gaussian distribution, 
such as a uniform distribution of equal or 
different mean. Lastly, we would like to 
evaluate the effect on performance of varying 
the number of sensed values N, of modifying the 
mean m,, of the prior slip distribution based on 
experience, and of changes in the sensor noise 
0 ^( 1 ), which we might adjust in an ad-hoc way to 
account for the aliasing which the point-range 
measurements will have in estimating the 
average elevation over the histogram bins, 
where the spectrum of y(x) might grossly 
violate the Nyquist sampling theorem when 
binned in this manner. 



The assumed model for y(x) in the simulation 
needs to be chosen with some care. A scale- 
invariant (fractal) model is attractive, but we 
need to recognize that the hazard-detection and 
avoidance system will effectively clip the 
distribution of terrain features at some 
particular scale. Similarly, a model for s(x) 

needs to be formulated, which will be slowly 
varying and of low amplitude. It would be good 
to assess the performance of the system when 
the slippage and sinkage are correlated, as one 
would intuitively expect, even though the model 
does not incorporate that effect (although it 
easily could). Another interesting correlation 
which would be good to model in the simulation 
is the fact that the mechanical linkages in the 
vehicle chassis cause the noise in the 
measurements z(i) to be highly correlated (since 
wheel pairs are at opposite ends of links), even 
though they may be Gaussian (from digitizing 
analog potentiometer values or peak detection in 
analog CCD scan lines). 

The simulation model for y(x) and s(xt 

As mentioned above, we desire to test the 
sinkage and slippage estimation algorithm on 
terrain which is 'scale invariant". Specifically, 
we wish to create a sample random terrain in 
the form of a histogram (i.e. sequence) at the 
same resolution as that maintained by the 
estimation algorithm. This is accomplished by 
uniformly sampling a linear combination of sine 
waves, whose amplitude is random over a 
uniform range extending from zero to some fixed 
multiple of the wavelength (thereby ensuring 
scale invariance), and whose phase is random 
over [0,2rt]. Twenty different wavelengths are 
combined over the range from 1 cm to 1.9 
meters, with each one 30% longer than the 
previous one. This range encompasses all scales 
of interest: smaller scales average to zero over 
the bins and longer scales are virtually flat over 
the length of the vehicle and its look-ahead 
ranging sensor. (Note that the smaller scales 
will exhibit substantial aliasing when binned, 
which is an important and real effect that needs 
to be modelled by the analysis.) As mentioned 
before, a "smooth" simulated terrain is realistic 
here, since the geometric hazard detection 
system will avoid rough or discontinuous 
terrain. 

The terrain we construct here is characterized 
by a single parameter: the maximum slope of 


each sine wave component. We call this 
parameter the "roughness" of the terrain. Both 
y(x) and s(x) are created by this technique, but 
s(x) is clipped at zero so that only positive 
values of sinkage are allowed. The "estimated" 
histograms of y and s are initialized with the 
"actuai" values from this simulation: from that 
point on the estimation procedure extends them. 
This is reasonable since, as mentioned above, 
the first meter or so of traverse will be on the 
lander exit ramp and therefore known. We 
arbitrarily set the roughness of the sinkage 
function s(x) to be 20% that of y(x), based on the 
philosophy that the terrain mechanical 
characteristics are more slowly-varying than 
the surface topography. 

It is perhaps worth mentioning that the approach 
of combining sine waves over a large number of 
different scales is computationally intensive, 
but need only be done once to simulate a large 
number of different terrain types, since to 
change the "roughness" only requires rescaling 
the vertical coordinate of a "standard" terrain, 
i.e. with unity roughness. Another approach to 
generating scale-invariant terrain, the use of 
Gauss-Markov random sequences, needs to be 
fairly high-order to get the needed range of 
scales and thus becomes extremely complex to 
analyze. 

Specifics parameters for initializing the model 
are drawn from the actual design of the Rocky 
3.2 microrover. Thus, for example, the distances 
from the sensed points to the rear wheel contact 
point are 25, 50, 60, and 80 cm for the middle 
wheel, front wheel, downlooking range sensor, 
and outlooking range sensor, respectively. The 
sensor noise (standard deviations) associated 
with these elevation differences are 0.04 mm 
for the wheel sensors, and 2 mm for the 
look-ahead sensors. We normally expect the 
vehicle to advance about 5 cm in each sensing 
cycle. 

Simulation Trials 

For each trial run, we evaluate the odometry 
error and sinkage error as a function of bin size 
and terrain roughness for different input 
assumptions. We evaluate bin sizes from 0.2 cm 
to 8 cm, which spans the range from very fine to 
very coarse compared the the expected forward 
advance per cycle. We evaluate terrain 
roughness ranging from a maximum slope at each 
scale of 0.25% to 8%, which spans terrain from 


very smooth (with typical elevation differences 
of 2 mm over the length of the vehicle) to very 
rough (with 8 cm typical elevation differences 
over the length of the vehicle, about the limit 
which the hazard avoidance system would 
permit). The simulation covers 62.5 meters of 
simulated terrain (25,000 bins at the finest bin 
size), which is created once and then resampled 
for the different simulations so that the effects 
of aliasing can be evaluated on identical terrain. 

One important issue not addressed in the 
previous description of the algorithm is the 
choice of the search range for the 
weighted-sum-of- squares (WSS). Initially, the 
search was extended to 4o + 4 bins beyond the 
Bayesian prior mean. However, it was found that 
the simulation would occasionally get “stuck" 
and fail to advance the rover by the proper 
amount for several cycles, whereupon the 
simulation lost track of the terrain (i.e. 
presumably the internal histogram for y(x) had 
no relation to the actual y(x)). This was caused 
by the global minima of the WSS function not 
corresponding to the actual forward advance. A 
simple fix for this problem was to compute the 
secondary minima, and if it was beyond the 
global minima and nearly as good (within a 
factor of 3), then the search range on the next 
cycle was extended to include that minima. Note 
that, in all cases, the global minima is chosen 
for the simulation, and only that the search 
range is extended if another minima shows 
promise, so that it can be selected as the global 
minima on the next cycle. This effectively cured 
the problem, and subsequently the simulation 
was not observed to lose track of terrain. 

Since we expect the look-ahead ranging sensor 
to have much worse measurement accuracy than 
the chassis articulation sensors, we will 
characterize the slippage estimation with the 
articulation-based elevation sensing noise, 
while the sinkage is based on the look-ahead 
sensor noise. 

Thus we represent the results of this analysis 
by plotting the sinkage or slippage error against 
the terrain roughness value. Typically we would 
expect to have little or no cumulative error 
when the terrain is very rough, and if the terrain 
is smooth the algorithm will just return the 
Bayesian prior mean value as the result, so the 
error that accumulates is just the difference 
between the Bayesian prior mean and the actual 


mean value. Thus for slippage, for example, if 
the Bayesian prior is in error by 20% (that is, 
the actual expected distance advanced per 
sensing cycle is different from the Bayesian 
prior mean by 20%), then we would expect the 
algorithm to smoothly transition from small 
error to 20% error in estimating traverse 
distance as the roughness is increased from zero 
to a large value. We wish to establish the 
nature of this curve for both slippage and 
sinkage. Furthermore, to reduce computational 
complexity, we wish to determine how coarse 
the histogram bin size can be without excessive 
degradation of these results. 

Table 1 shows the program output for the first 
test case, where the Bayesian prior 
overestimates the forward advance by 20%. 
Each entry in the table is the percentage 
odometry error over the 62.5 meter course, 
followed by the RMS sinkage error in parenthesis 
(in cm). Note that, indeed, the odometry error 
more-or-less smoothly falls from 20% for 
smooth terrain to near zero for rougher terrain. 
Furthermore, note that the performance 
improves as the bins get larger up to a point, and 
then declines for larger bin sizes, especially on 
rough terrain. 

The two effects which seem to be occurring are 
severe aliasing for large bins (when the bins are 
larger than the advance of the vehicle), and poor 
terrain modelling for small bins. The former 
effect is compounded by the fact that we cannot 
fit a parabola to the WSS function if the minima 
is at zero bins of advance, since we do not 
compute the function for negative advance and 
so cannot bound the integral minima with values 
on each side, as needed for a parabolic 
interpolation. In this case we merely set the 
forward advance estimate to exactly zero. For 
large bins (e.g. 8 cm when the expected forward 
advance is 5 cm) this occurs commonly, and is 
only sometimes compensated for in later cycles. 
This produces a strong tendency to 
underestimate the distance travelled. 

For very small bins, on the other hand, the 
algorithm we have selected for modelling the 
sparsely-sampled terrain is inadequate. 
Remember that we incorporate new z[i] 
measurements into the histogram by forcing the 
value at bin i to be consistent, and then "split 
the difference" on the i-1 and i+1 bins. When the 
bins are very fine this will produce narrow 
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SLIPPAGE ERROR AS A FUNCTION OF ROUGHNESS AND BIN SIZE 
(each entry percent odometry error, RMS sinkage error (cm)) 
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Simulation Parameters 

Actual mean advance per cycle: 5.0 cm. Sigma: 1.00 cm 

Bayesian prior mean advance per cycle: 6.0 cm. Sigma: 0.05 cm 

Unit-Roughness Terrain RMS Amplitude 1.11 meters over 62.5 meters 


Statistical Attributes of Unit-Roughness Simulated Terrain by Bin Size 
(each entry RMS bin-to-bin elevation change (cm) , 

RMS error in bin-to-bin linear projection (cm) ) 

Bin 


Size 

X 

(cm) 

X. 

,00 

X. 

25 

X, 

50 

X. 

75 

0.— 

0.00 

0.00 

2.40 

1.29 

4.62 

4.07 

6.58 

6.38 

1.— 

8.30 

7.47 

9.90 

7.85 

11.50 

8.12 

13.14 

8.74 

2.— 

14.81 

9.40 

16.50 

10.14 

18,18 

10.82 

19.86 

11.50 

3.— 

21.52 

12.51 

23.16 

13.81 

24.77 

14.78 

26.40 

15.74 

4.— 

27.87 

16.84 

29.73 

18.87 

31.38 

20.58 

33.06 

22.26 

5.— 

34.70 

24.27 

36.33 

26.42 

38.02 

28.56 

39.60 

30.45 

6.— 

41.24 

32.4 9 

42.69 

33.57 

44,22 

35.31 

45.69 

37.22 

7.— 

47.24 

39.24 

48.90 

41.42 

50.43 

43.60 

51.91 

45.52 


Table 1 


"spikes* in the histograms, and not at all 
correspond to realistic terrain. The proper fix 
for this would be to "remember" when and where 
each prior measurement was taken, and try to 
perform a statistically-vaiid terrain 
reconstruction (based on some assumed terrain 
Fourier spectrum), incorporating all prior 
measurements and their uncertainties. However, 
this would be computationally demanding, and 


the procedure we have adopted seems to work 
quite well for intermediate-sized bins, about 2 
cm long. 

Note that the sinkage estimates in Table 1 are 
all about the same for a given roughness, and 
increase more-or-less proportionally to 
roughness. This is intuitively pleasing, since 
the high accuracy of the wheel sensors compared 
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to the look-ahead sensors makes the estimate of 
the forward advance of the vehicle (i.e. the 
minima of the WSS function) almost entirely a 
function of the wheel sensors. Thus, the 
primary function being estimated accurately is 
the loadbearing surface y(x)-s(x), with both y 
and s being much more uncertain than their 
difference. Then sinkage is estimated using the 
look-ahead sensor(s), with their large attendent 
noise. This suggests that a more appropriate 
implimentation for the actual vehicle is to use 
the wheel sensors alone to estimate travel along 
the loadbearing surface, and to use only one 
look-ahead sensed value to estimate sinkage. 
Thus it is irrelevant to examine the case of 
additional look-ahead sensing values so long as 
their noise is very large compared to the chassis 
articulation sensing. The “roughness" scale used 
corresponds approximately to the RMS elevation 
differences in meters over the scale of the 
vehicle, i.e. a roughness of 0.08 gives 8 cm of 
typical elevation difference across the vehicle. 
At the bottom of Table 1 is a chart showing 
some of the statistical properties of the 
unit-roughness simulated terrain: the RMS 
bin-to-bin elevation change and the RMS error in 
a bin-to-bin linear projection to the next bin, 
each as a function of bin size. This table has cm 
of bin size along the left, with fractions of a cm 
along the top. Note that the values for zero bin 
size, which in fact don't exist, are set to zero 
for printing purposes. 


There is one striking fact represented in Table 
1: we have selected the standard deviation of 
the Bayesian prior to be 0.05 cm (1% of the 
actual advance), when the sigma of the actual 
vehicle advance per cycle is 1 cm. This 
artificially "overweights" the Bayesian prior to 
show the smooth transition from 20% error to 
small error as the terrain gets rougher. 
However, the chassis articulation sensors are so 
accurate (o=0.04 mm) that we can do much 
better than this. Figure 3 shows the results for 
different values of the Bayesian prior (1% and 
10% of the actual). As one can see, with lower 
confidence in the prior, even on smooth terrain, 
the results are very good for bin sizes of about 2 
cm (ranging from 5% error on very smooth or 
rough terrain to under 1% error on moderate 
terrain). This, again, is to be expected, since 
even the smooth terrain has large excursions 
compared to the sensor noise. If we reduce the 
prior variance further, however, the estimator 
performance degrades rapidly. This presumably 


occurs because much more error exists in the 
terrain histogram reconstruction than would be 
apparent from the sensor noise alone. Thus, if 
the simulation is not "driven" strongly by the 
Bayesian prior, it is "free" to choose any match 
to the sensor data, weighted artificially heavily 
due to the low sensor noise. Thus, even though 
the sensors are good, the terrain estimates 
which result from our sparse sampling and crude 
interpolation are not nearly so good. Thus 
weighting the perceived errors from this 
function by one over the sensor variance is 
unrealistic; we compensate by making the 
Bayesian prior very tight. Thus there is no 
particular value to be gained in evaluating 
somewhat different levels of sensor noise. 


Percent 

Odometry 

Error 



Figure 3. Odometry error as a function of 
terrain roughness (20% actual slip) 


Numerous additional runs analogous to Table 1 
were performed using different simulated 
terrain (using different seeds for the random 
number generator), and the results were 
virtually identical. Note that there are 
occasional anomalies where the performance is 
poor (such as in Table 1 at roughness 0.04 and 
bin size 1.75 cm). These anomalies presumably 
result when the terrain and binning processes 
conspire to give ambiguous terrain for matching 
purposes. This is to be expected, but so long as 
it is rare and does not give worse estimates 
than doing nothing (i.e. using just the prior mean 
estimate), then no harm is done. This is another 
reason to overweight the prior distribution. 
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Additional runs explored several issues. For 
example, when the prior distribution 
underestimates the forward advance, the 
performance is generally good for bin sizes 
between 2 and 3 cm, but that very bad 
performance is not uncommon. Another issue 
considered was the estimation performance 
when the actual forward advance is not 
Gaussian. Once again, the performance was 
excellent. Lastly, we considered the system 
performance when the actual slip is a function 
of sinkage and slope, as one would expect. The 
results were evaluated for the case when the 
mean of the actual advance per cycle drops 
linearly with increasing sinkage and/or slope, 
(and continuing with the non-Gaussian uniform 
actual distribution). Since the very rough 

terrain will probably have slopes and sinkages 
which would literally stop the vehicle under 
such an assumption, we clipped the left end of 
the uniform distribution at zero advance per 
cycle, so that the simulation doesn't get in an 
infinite loop (as would the actual vehicle). Here 
we have assumed that the linear coefficients 
are such that a 60% grade will stop the vehicle, 
as would sinkage of 5 cm. The performance on 
smooth terrain was poor, as the Bayesian prior 
of 6 cm/cycle was much larger than the actual 
average, which is 5 at best and 1 at worst, 
depending on terrain conditions. However, as 
soon as the roughness increases to 1 cm or so 
over the length of the vehicle, the odometry 
performance improves to within 10% and at 2-4 
cm roughness. Only a few percent of odometry 
error is observed for bin sizes between 2 and 3 
cm. This performance is very encouraging 
considering the simplicity of the model and the 
gross deviations which "reality" makes with the 
assumptions underlying the model. 

Computati onal Requirements 

As described above, the optimal bin size is in 
the neigborhood of 2.5 cm, which means that 
there are only 20 bins of data over the length of 
the vehicle to be accumulated and maintained, so 
the compuation and storage requirements are 
small. Good performance can be anticipated 
with only 2 terms in the WSS function- one for 
the Bayesian prior (which can be precomputed 
and stored in a table) and one for the center 
wheel, since the look-ahead sensor is so noisy 
as not to affect the forward-advance estimate. 
(The front wheel moves onto unknown terrain, 
and so is not used in the matching process.) 


Since squaring can also be accomplished as a 
table look-up, the computation is of the order of 
1 add and 2 table look-ups per bin, with 
typically 5 bins searched. Finding the global and 
secondary minima requires roughly 2 
comparisons per bin. Maintenance of the 
histogram requires a relatively few operations 
also, since the histogram data can be in a ring 
buffer with a pointer, to avoid actually shifting 
the data in an array. Thus only the linear 
interpolation and "split the difference" 
operations are needed, which are simple. This 
implies that, with of the order of 100 
operations per cycle, the odometry estimates of 
the vehicle can be markedly improved, and 
sinkage estimates provided. 

Preliminary Test Results 

The algorithm described above has been 
implemented on Rocky 3.2 and, as of this 
writing, a few test runs have been conducted. 
The preliminary indication is that the 
articulation sensor noise is substantially larger 
than anticipated, leading to odometry results 
which are somewhat degraded compared to the 
simulations. However, it appears that, even 
with the noisy data, the algorithm will give a 
very reasonable hazard alarm for sinkage and 
slippage. (In this case, we set the confidence in 
the Bayesian prior to be very high, and then 
threshold the WSS function to trigger a slip 
alarm.) Work is continuing on reducing the noise 
in the analog-to-digital portion of the system. 
Extensive tests for this system are planned for 
1994. 

Conclusions 

The MAP estimation procedure developed here, 
based on a simple weighted-sum-of-squares 
computation, seems to give sinkage and slippage 
estimates which will allow planetary 
microrovers to detect and avoid a wide range of 
non-geometric hazards. Simulations suggest 
that it may also improve odometry markedly 
over simple wheel revolution counting, and 
thereby lead to a significant improvement in 
dead-reckoning accuracy for this class of 
vehicle. 
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Abstract 

In the design and construction of mobile ro- 
bots vision has always been one of the most potentially 
useful sensory systems. In practice however, it has also 
become the most difficult to successfully implement. 
At the MIT Mobile Robotics (Mobot) Lab we have de- 
signed a small, light, cheap, and low power Mobot Vi- 
sion System that can be used to guide a mobile robot in 
a constrained environment The target environment is 
the surface of Mars, although we believe the system 
should be applicable to other conditions as well. It is 
our belief that the constraints of the Martian environ- 
ment will allow the implementation of a system that 
provides vision based guidance to a small mobile rover. 

The purpose of this vision system is to process 
realtime visual input and provide as output information 
about the relative location of safe and unsafe areas for 
the robot to go. It might additionally provide some 
tracking of a small number of interesting features, for 
example the lander or large rocks (for scientific sam- 
pling). The system we have built was designed to be 
self contained. It has its own camera and on board pro- 
cessing unit. It draws a small amount of power and ex- 
changes a very small amount of information with the 
host robot. The project has two parts, first the construc- 
tion of a hardware platform, and second the implemen- 
tation of a successful vision algorithm. 

For the first part of the project, which is com- 
plete, we have built a small self contained vision sys- 
tem. It employs a cheap but fast general purpose mi- 
crocontroller (a 68332) connected to a Charge Coupled 
Device (CCD). The CCD provides the CPU with a 
continuous series of medium resolution gray-scale im- 


ages (64 by 48 pixels with 256 gray levels at 10-15 
frames a second). In order to accommodate our goals 
of low power, light weight, and small size we are by- 
passing the traditional NTSC video and using a purely 
digital solution. As the frames are captured any desired 
algorithm can then be implemented on the microcon- 
troUer to extract the desired information from the imag- 
es and communicate it to the host robot. Additionally, 
conventional optics are typically oversized for this ap- 
plicaticMi so we have been experimenting with aspheric 
lenses, pinholes lenses, and lens sets. 

As to the second half of the project, it is our 
hypothesis that a simple vision algorithm does not re- 
quire huge amounts of computation and that goals such 
as constructing a complete three dimensional map of 
the environment are difficult, wasteful, and possibly un- 
reachable. We believe that the nature of the environ- 
ment can provide enough constraints to allow us to ex- 
tract the desired information with a minimum of com- 
putation. It is also our belief that biological systems re- 
flect an advanced form of this. They also employ con- 
stant factors in the environment to extract what infor- 
mation is relevant to the wganism. 

We believe that it is possible to construct a 
useful real world outdotv vision system with a small 
computational engine. This will be made feasible by an 
understanding of what information it is desirable to ex- 
tract from the environment for a given task, and of an 
analysis of the constraints imposed by the environment 
In order to verify this hypothesis and to facilitate vision 
experiments we have build a small wheeled robot 
named Gopher, equipped with one of our vision sys- 
tems. 


Copyright 1993 by Andrew Gavin. Published by the American Institute of Aeronautics and Astronautics, Inc. with permission. 
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1. Philosophy 

In the design and construction of mo- 
bile robots vision has always been one of the 
most potentially useful sensory systems. 
However, it has also become in practice the 
most difficult to implement successfully. 
Here at the Mobot Lab we have designed a 
small, light, cheap, and low power vision sys- 
tem that can be used to guide a mobile robot 
in a constrained environment. At this point 
we are using as our target environment the 
surface of Mars. It is our belief that the con- 
straints of this environment will allow the im- 
plementation of a system that provides vision 
based guidance to a small mobile Martian 
rover. 

For many animals vision is a very im- 
portant sensory system. In primates (and 
therefore humans) vision processing is the pri- 
mary sensory mode and occupies a large por- 
tion of the neo-cortex. While it is clear that a 
variety of senses are essential to a mobile enti- 
ty, be it robot or animal, vision has a number 
of substantial advantages. It is able to provide 
a survey of a fairly broad section of the world 
(approximately 2CiO degrees in humans) and at 
some considerable distance. The regular inter- 
play of light and surfaces in the world allow 
internal concepts such as color and texture to 
be extrapolated, making object recognition 
possible. Additionally, vision is a passive sys- 
tem, meaning that it does not require the emis- 
sion of signis. No other mode of sensation 
has all these properties. Chemo-receptors 
(smell and taste) are inherently vague in direc- 
tionality. Somatic (touch) input is by defini- 
tion contacting the organism, and therefore 
provides no input from distant objects. The 
auditory system is probably the second most 
powerful mode, but in order to provide infor- 
mation about inanimate (and therefore silent) 
objects it must be an active emitter, like bat 
sonar. However, it is only underground and 
^p in the ocean that the environment is un- 
interesting in the visual range of the spectrum. 

Despite this, a useful artificial vision 
system has turned out to be very difficult to 
implement. Perhaps this is because the com- 
plexity and the usefulness of the system are 
linked. Perhaps it is also because no other sys- 
tem must deal with such a large amount of 
input data. Vision receives what is essentially 


a three dimensional matrix as input, two spa- 
tial dimensions and one temporal dimension. 
This input’s relationship to the world is that of 
a distorted projection, and it moves around 
rapidly and unpredictably in response to a 
laTjge number of degrees of freedom. Eye po- 
sition, head position, body position, move- 
ment of object in the environment just to name 
a few. The job of the vision system is to take 
this huge amount of input and construct some 
small meaningful extraction from it. Never- 
theless, whatever the reason for the difficulty 
in implementation, it is clear from ourselves 
and other animals that vision is a useful and 
viable sense. 

For this project we had as our goal the 
construction of a small vision system that 
takes as a visual input the view from atop a 
small Martian rover. Its job would be to then 
quickly process this input in realtime and pro- 
vide as output a small bandwidth of informa- 
tion reporting on the relative location of safe 
and unsafe areas for the robot to go. It might 
additionally provide some tracking of a small 
number of features “interesting” to the rover, 
for example the lander. The vision system 
was designed to be self contained. It has a pan 
Md tilt camera head and an on board process- 
ing unit. It also draws a small amount power 
and exchanges a very limited degree of infor- 
mation with the host robot. The project has 
two parts, first the construction of a hardware 
platform, and second the implementation of a 
successful vision algorithm. 

Professor Rodney A. Brooks, the head 
of the Mobot lab, has long been the champion 
of the belief that small cheap systems with a 
biologically based “behavioral” design can 
provide excellent results in real mobile robot 
applications [l]. He has demonstrated this 
with many small robots that have provided ro- 
bust powerful performance with very small 
amounts of processing power. It is fairly 
widely believed that the Mobot lab’s robots 
are some of the most successful fully autono- 
mous robots built to date. They include nota- 
bles such as Squirt [2], the tiny fully autono- 
inous robot, and Ghengis and Atilla, a pair of 
highly robust small legged robots. Professor 
Brooks also believes that small cheap robots 
should be used in space exploration [3]. 

As to the second half of the project, it 
is our hypothesis that a simple vision algo- 
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rithm does not require huge amounts of com- 
putation. That goals such as constructing a 
complete three (Umensional map of the envi- 
ronment are difficult, wasteful, and possibly 
unreachable. We believe that the nature of the 
environment can provide enough constraints 
to allow us to extract the desir^ information 
with a minimum of computation. It is also our 
belief that biological systems reflect an ad- 
vanced form of this. They employ constant 
factors in the environment to extract what in- 
formation is relevant to the organism. 

This theory has already been used in 
our lab to implement a mobile robot that is 
“among the simplest, most effective, and best 
tested systems for vision-based navigation to 
date” [4; 5; 6]. We believe that these ideas can 
be combined with what is known about the 
Martian surface to create a system able to pro- 
vide useful information to a Martian rover. 
The few existing Martian surface pictures re- 
turned from the Viking landers show a flat 
landscape of fine dust with protruding rocks. 

We have implemented several differ- 
ent algorithms on our system. Several of these 
attempt to extract the same navigational infor- 
mation from the scene. Each of these tech- 
niques is sensitive to different features, and so 
the simultaneous use of multiple approaches 
can yield added reliability. This ^gorithms 
are (hscussed below in detail. 

We believe that it is possible to con- 
struct a useful real world outdoor vision sys- 
tem with a small computational engine, lliis 
will be made feasible by an understanding of 
what information it is desirable to extract from 
the environment for a given task, and of an 
analysis of the constraints imposed by the en- 
vironment. 


2. The Mobot Vision System:an 
ACTIVELY CONTROLLED CCD CAMERA 
AND VISUAL PROCESSlNCw BOARD 

We had as our design goals the cre- 
ation of a compact vision system which was 
cheap, simple, and had a low power consump- 
tion. We wanted to have everything needed 
for simple vision built in, including a signifi- 
cant amount of processing power. We chose 
the Motorola 68332 as the brain. This is an 
integrated microcontroller with on board 


RAM, serial hardware, time processing unit 
(TPU), and a peripheral interface unit. It has a 
^cent amount of horsepower, essentially that 
of a 16-25 MHz 68020, and has a software 
controllable clock speed for power regulation. 
The 68332 was available in a simple one 
board solution from Vesta Technologies. To 
this we added 1 Megabyte each of RAM and 
of EPROM. 

In accordance with om philosophy we 
decided on an image size of 64 by 48 with 256 
levels of gray. These images are large enough 
that most important details of the view are 
clearly visible and recognizable to humans 
(see adjacent 
example image). 

Any number of 
gray shades in 
excess of about 
64 are nearly in- 
distinguishable, 
but 256 is con- 
venient and al- 
lows for a neat single byte per pixel represen- 
tation. Our chosen resolution requires an 
image size of 3K bytes. We also chose a 
frame rate of 10-30 frames per second. We 
believe that both the chosen frame rate and 
resolution are more than sufficient to allow the 
simple visual tasks which we intend this sys- 
tem to accomplish. They also make for a total 
bandwidth of 30-90K bytes/second, which is 
well within the amount of data that the 68332 
can transfer while leaving lots of free process- 
ing time. It takes approximately 7 millisec- 
onds for the processor to clock and read in a 
single frame. At 10 frames per second this 
only consumes 7% of a 16MHz 68332’s pro- 
cessor time. This allows 93% for processing 
the images, which amounts to about 186,000 
instructions per image, or about 62 instruc- 
tions per pixel. 62 instructions per pixel, 
while it might not seem like much, or more 
than sufficient to do many of the simple vision 
tasks we have in mind. If more calculation is 
required, then a faster version of the chip can 
be used to double the available power. 

To the CPU Board we added two 
piggy back boards: a Camera Board, and a De- 
bugging Board. The Debugging Board has 2 
hex LEDs for output, 6 bit DIP switch for 
input, a reset and debug button, and some 
video circuitry. These circuits are extremely 
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simple because the 68332’ s interface module 
allows nearly any kind of device to be mapped 
onto the bus. Even the video circuitry is very 
simple. We use two chips, a video level Digi- 


sumption reasons. Additionally the system is 
completely capable of running without the De- 
bugging Board attached, so that when the de- 
velopment phase is complete, it can be re- 


Debugging board 



tal to Analog Converter (Intech VDAC1800) 
and a video sync generator. The timing sig- 
nals are run into the extra two input bits in our 
8 bit input port and a trivial program on the 
CPU watches for transitions and outputs 
image data to the D to A convertor at the ap- 
propriate times. This allows us to display im- 
ages contained in the CPU at 30 hertz to a 
video monitor with just two chips. Most of 
the work is done by the CPU. The system is 
capable of capturing images from the camera 
and outputting them to the video display at 30 
hertz. Since video output consumes consider- 
able CPU time it is not advisable to run the 
display at the same time as doing extensive vi- 
sion calculations. However it provides a nice, 
easy output of the camera image in realtime 
which is essential for debugging, and for tun- 
ing the camera circuitry. There are also two 
switches on the Debugging Board to toggle 
the LED and video hardware for power con- 


moved. 

The Camera Board is the heart of the 
system. It contains a high speed Analog to 
Digital Converter (Sony C^®1175), some 
timing circuitry, the CCD level converters, 
and some output ports for our eye positioning 
servo actuators. It was one of the design con- 
siderations of this project that we should not 
use analog video at any point in our capture 
process (the video output on the Debugging 
Board is the only video circuitry in the entire 
system). At the resolution and frame rate we 
desire video adds unnecessary complications. 
It forces the use of high resolution CCD’s, and 
a 30 hertz frame rate. Besides, since we desire 
to capture the frames digitally on the CPU 
Board there is no need to go through this con- 
voluted intermediate stage. Instead we chose 
a low resolution CCD from Texas Instruments 
(TC211), clock the chip directly from the pro- 
cessor, amplify the signal, and read it through 
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the A to D converter straight back into the 
processor. A simple pro^am on the processor 
generates the needed timings, and since the A 
to D converter is connected as a memory 
mapped device, it simply reads in the pixel 
data. Since the CCD is a 192 by 165 device 
the input program merely clocks over two out 
of three pixels and lines in order to subsample 
the image at 64 by 48. A slightly modified 
version of the software is capable of grabbing 
192 by 144 at the same frame rate, but results 
in a consequent reduction of the number of in- 
structions available to each pixel (about 7 at 
10 hertz). 

The generation of the timing turned 
out to be quite simple [ 7 ]. The Integration sig- 
nal (INT) used to signal the exposure timing 
was simply connected to a CPU parallel pin, 
as was the Image Area Gate (lAG) signal used 
to clock lines downward on the CCD. A third 
signal, the Anti Blooming Gate (ABG) was 
generated automatically by the 68332’s TPU 
at no cost in CPU time. The only difficult sig- 
nal was the Serial Register Gate (SRG) signal 
which shifts pixels outs of the current row. 
This signal must be as fast as possible and 
timed precisely to the A to D converter’s sam- 
pling clock in order to get the peak of each 
pixel’s signal. Fortunately, since the 68332 
automatically puts out a chip select signal to 
the A to D converter to signal its possession of 
the bus, we used this as the SRG. By running 
this chip select signal through an adjustable 
delay and then into the convertor’s sampling 
clock we were able to match the time it takes 
the CCD and amplifier to actually output the 
pixel. The Camera Board has several adjust- 
able potentiometers, an adjustable delay knob, 
a signal offset knob, and a signal gain knob. 
All must be adjusted together in order to 
achieve a good picture. 

Also on the Camera Board is the level 
shifter circuitry used to drive the CCD chip. 
This was specially designed with both sim- 
plicity and low power consumption in mind. 
The CCD chip requires its clock signals to be 
at specific andog voltages and so we explored 
three methods of converting the processor’s 
TTL level signals. The first method was to 
employ the driver chips sold by the CCD man- 
ufacturer for this purpose. We rejected this 
because of the high power consumption which 
seems to be unavoidable in high speed clock 


generation circuitry. The second method was 
to use an operational amplifier to add analog 
voltages. Because we wanted a low power 
circuit, and also wanted to reduce the number 
of components, we chose the third solution, 
which was to use analog switches that could 
toggle the voltage at a reasonably high fre- 
quency and which were fast enough for the 
processor’s clock rate. Our circuit resulted in 
a total power consumption for the Camera 
Board of less than half a watt (when it is sup- 
plied +5V, +12V, and -12V). 

From the Camera Board a six wire 
cable connects to the camera head. Since the 
robot needed to insure as wide an angle as 
possible, we explored small short-focal-length 
lenses. Generally wide angle lenses have sev- 
eral merits, such as a wider depth of focus, 
which makes a focusing mechanism unneces- 
sary, a smaller F number, which increases the 
image brightness, and an insensitively to cam- 
era head vibration. However, it is sometimes 
difficult for wide angle lenses to compensate 
for the image aberrations. After testing sever- 
al aspheric lenses, pinhole lenses, and CCD 
lens sets, we decided to use a f=3mm CCD 
lens from Chinon (0.6” in diameter and 0.65” 
long, 5g weight). 

In front of the lens we placed ND fil- 
ters in order to prevent over saturation of the 
CCD. Our CCD is actually quite sensitive and 
needs at least a 10% pass filter to operate in 
room level lighting, sometimes it even needs 
considerably more. In order to expand the dy- 
namic range of the camera the frame grabbing 
software is designed to calculate the light level 
of the image and adjust the integration (expo- 
sure) time of the frame correspondingly. TTiis 
adds an extra 10 decibels of dynamic range to 
the system, allowing it to work adequately in 
subjective indoor light levels ranging from 
lights off at night to sunlight streaming in 
through several large windows. 

The camera is mounted on top of two 
Futuba model airplane servo actuators (FP- 
S5102). These are very small and light 
weight, and allow easy and fairly precise con- 
trol of the camera position by the CPU. The 
servo actuators are connected via the Camera 
Board to the 68332’s TPU. This allows the 
generation of their Pulse Width Modulated 
(PWM) control signals with virtually no CPU 
overhead. These actuators give the camera 
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both pan and tilt over a fairly wide range (170 
degree pan, % degree tilt). The CPU has a 
numjjer of simple routines that allow it to 
specify both absolute and relative positioning 
of the actuators, to read where they are, and 
preforms bounds checking to prevent the actu- 
ators from twisting the camera into them- 
selves. The camera head and its servo actua- 
tors weigh 68 grams. 

We have gone to a great deal of effort 
to minimize the power consumption of the Vi- 
sion System and have been quite successful. 
The CPU board consumes 0.5 watts of power. 
The Camera Board also uses 0.5 watts. This 
means that the entire CPU and vision system 
consumes less than 1 watt of power. The 
video out circuitry on the Debugging Board 
requires an additional 1 watt of power, howev- 
er there is a switch to disable this circuit, and 
since its use is for debugging this is not signif- 
icant. The servo actuators also require some 
power. When idle they consume a 
mere 0.05 watts. Unless they are 
being moved constantly their average 
power draw is barely more than this. 

Cost was also a significant 
factor in our design. The Vesta CPU 
Board costs $300. One megabyte of 
static RAM costs $2(X) (one could 
use less to save money, a megabyte 
of EPROM costs $25 (again one 
could use less to save money), two 
servos cost about $130, the CCD 
costs $35, the driver chips $30, the 
analog to digital convertor $25, the 
video chips $50, the power convertor 
$65, and miscellaneous other chips 
about $10. This is a total cost of 
around $700, many significant com- 
ponents of which could be eliminat- 
ed to save njoney. One could use 
less RAM, or forego the servos or 
Debugging Board, possibly bringing 
a useful system as low as $350 in 
parts. 

Overall this system is a small, 
cheap, and low power vision solu- 
tion. It provides the input of 64 by 
48 pixel 256 level gray scale frames 
at 10-30 hertz from a small camera 
with CPU controlled pan, tilt, and 
dynamic range, as well as about 62 
680x0 instructions per pixel of pro- 


cessing time at 10 frames per second. All of 
the electronic circuits fit in a space 15 cubic 
inches big, consume less than a watt of 
power, and cost about $700 dollars to build. 
The available processing time is sufficient to 
do simple calculations such as blurs, edge de- 
tections, subtractions, the optical flow normal, 
thresholding and simple segmentation. A 
number of these can be combined to provide 
useful realtime vision based information to a 
robot. 


3. Gopher: a prototype vision based 

ROBOT 

In order to fully test our system in a 
real environment we have been building a 
small vision based robot. Gopher (see dia- 
gram). This robot is based on a R2E robot 
from ISR. The robot is quite small (15 inches 
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tall, 10 inches wide, and 12 inches long). It is 
wheeled, has a gripper on the front, and a 
number of sensors, including shaft encoders, 
touch sensors, and infrared sensors. These 
motors and sensors are controlled by a number 
of networked 6811 processors. A master 
68332 processor, mns the behavior language 
which was designed here at the MIT Molwt 
lab. We have added a flux gate compass accu- 
rate to 1 degree and two of our 68332 based 
vision systems. The boards are all contained 
within the R2 case, which has been extended 
vertically for this purpose. We have mounted 
one of our CCD camera heads on its dual 


When all of the Vision Board variable 
knobs (adjustable delay, signal offset knob, 
and signal gain knob) are tuned properly the 
system captures an image with a full range of 
gray scales, which means a smooth clear 
image to the human eye. These parameters, 
while interrelated, can be tuned to a specific 
instance of the system in a few minutes. 
There is then little need to deal with them 
again unless a major system component (such 
as the amplifier, analog switch, or CCD) is ex- 
changed. 

The system is however fairly sensitive 
to light level. The CCD is very sensitive and 



servo base on top of the robot, adding an extra 
3 inches to the robot’s height. 

The Gopher robot provides a small and 
relatively simple low cost robot with an inde- 
pendently controlled camera and vision sub- 
system. We have used this system to imple- 
ment many simple visual bas^ tasks in a co- 
ordinated and integrated fashion. 


4. IMAGES! HOW GOOD IS 64 BY 48 

We have equipped the Vision System 
with a relatively wide angle lens (approxi- 
mately 60 to 70 degrees). This is most useful 
for robot applications because the relative 
characteristics of space and objects around the 
robot are of more concern than the specific de- 
tails at any one point. 

Human perception of these images is 
quite good (see example images below). Ob- 
jects approximately a meter square are easily 
visible at 20 feet. When angled down toward 
the ground the camera gives a good view of 
the space into which a forward driving robot 
will soon be entering. 


requires at least a 10% pass filter to operate at 
normal light levels. In bright sunlight we usu- 
ally add an additional 33% pass filter. By au- 
tomatically adjusting the integration time in 
software we can cope with a moderate range 
of changing light levels, sufficient to encom- 
pass most operating conditions in an indoor 
environment. At some extremes of this range 
the image becomes more highly contrasting 
and less smooth. However this dynamic range 
is not sufficient to cover multiple environmen- 
tal extremes, for example outside under sun- 
light and nighttime. To cope with this addi- 
tional hardware would be required to increase 
the dynamic range. We have considered sev- 
eral other options. Filters could be changed as 
conditions vary, either manually, mechanical- 
ly, electronically, or possibly using a kind of 
self adjusting filter (as found in some sun- 
glasses). Additionally we are exploring the 
possibility of using a CCD with an electronic 
shutter. This would allow for a significant in- 
crease in the dynamic range, but would com- 
plicate the production of the CCD control sig- 
nals. 



Blob Tracking 



Black dot tracks the blob on Black dot marks the center The shirt can be 

this motion blurred image of the segmented shirt segmented by intensity 


5. SOFTWARE 

The Mobot Vision System is program- 
mable in either 68332 (680x0) assembly or in 
C using the Gnu C Compiler (GCC). We have 
written a variety of basic routines. These 
setup the system, grab frames, actively adjust 
the integration time based on image light lev- 
els, output video frames, and move the camera 
via two servo actuators. 

It takes approximately 7 milliseconds 
to grab a frame. This means that 10 frames 
per second occupies 7% of the CPU, leaving 
62 assembly instructions per pixel free at this 
frame rate. We have coded in assembly a 
number of basic vision primitives. For a 64 
by 48 8 bit image they 
have the following ap- 
proximate costs: Blur 

(11 instructions per 
pixel), Center/Surround 
(11 instructions per 
pixel), Sobel edge detec- 
tion (6 instruction per 
pixel), image difference 
G instruction per pixel). 

Threshold and find cen- 
troid (worst case 6 in- 
structions per pixel). 

As can be seen 
from the above figures a 
number of these basic op- 
erators can easily be 
combined to make a fair- 
ly sophisticated realtime 
(10 fps) visual algorithm. 

By maJdng assumptions 
about the environment it 


is possible to construct algorithms that do use- 
ful work based on vision. For example, as a 
simple test case we have implemented code 
that thresholds an image and finds the center 
of any bright “blobs” in the image (see exam- 
ple images). This code requires at worst case 
6 instructions per pixel plus some trivial con- 
stant overhead. We then use this information 
to center the camera on the “brightness center” 
of the image. The result is that the camera 
will actively track a person in a white shirt, or 
holding a flashlight. It is able to do so at 15 
hertz while outputting video. This might not 
seem very useful, but by changing the thresh- 
olding conditions the camera would be able to 
track anything that can be segmented with a 



Mars from the Viking Lander 


692 





small amount of processing. Intensity bands, 
the optical flow normal, thresholded edges, 
(and with filters) colored or infrared objects 
are all easy candidates for this technique. 

We have used the Mobot Vision Sys- 
tem to implement a host of other visual based 
behaviors. Here at MIT Ian Horswill has built 
Polly, a completely visually guided robot that 
uses an identical 64 by 48 gray scale image 
and a processor only somewhat more powerful 
than the 68332. Polly is designed to give 
“brain damaged tours of the lab.” It is capable 
of following corridors, avoiding obstacles, 
recognizing places, and detecting the presence 
of people [5]. We have brought many of these 
skills to the Gopher system, and to Frankie, 
another lab robot based on the Mobot Vision 
System. The algorithms for avoiding obsta- 
cles and following corridors are easily within 
the power of one of these vision vision sys- 
tems. Ian Horswill has also used one of these 
vision systems, slightly modified to add an ad- 
ditional camera, to do binocular gaze fixation. 


6. Mqbots in Space; Appl ications to 
Mars 


NASA is planning on sending a small 
autonomous rover to Mars in the next few 
years as part of the Messur/Pathfinder mis- 
sion. It is our belief that this rover could ben- 
efit from some vision based guidance, and that 
the approach used in the Mobot Vision System 
would be very suitable. A remote rover sent 
to Mars will be very limited by weight, size, 
and power consumption. The current rover 
design uses a digital CCD camera system 
quite similar to ours, as well as a low power 
processor. For power reasons the rover oper- 
ates at very slow speeds, and so the algorithms 
we discussed here could run at a reasonable 
rate on its small processor. 

The surface of mars as captured by the 
Viking Landers is fairly flat (at least where 
they landed) and regular (see Viking image 
above). It consists of a surface of tiny dust 
particles littered with an fairly Gaussian distri- 
bution of rocks. A rover’s physical character- 
istics will determine what size of rocks are 
hazards and which are not. It would be useful 
for a vision system to be able to look forward 
and estimate roughly how much space in each 
direction there is until rocks that are too large 
to cross. 

The first of our algorithms is texture 
based. This algorithm depends on a number 

of assumptions. 
These assump- 
tions make it pos- 
sible to extract 
useful data from 
an image in a rea- 
sonable time, 
however, it is im- 
portant to be 
aware of the limi- 
tations they im- 
pose. If we as- 
sume that the 
ground is roughly 
fiat, as indeed it 
is in the Viking 
images, then 
rocks that are far- 
ther away will be 
higher up in the 
image. Addition- 
ally the low reso- 
lution of the cam- 
era is convenient 


A simple experiment employing a real Martian Image 



Image 1 
(orginal) 



Image 3 
(find edges) 



Image 2 
(sharpen) 


Image 4 
(threshold) 
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Close up of rock segmentation 



High detail original, large rocks circled Segmentation of large rock clusters 


because it will filter the dust and all small 
rocks into a uniform lack of texture. Texture, 
and therefore edges, is an indicator of objects 
or rocks. If the rocks can be segmented, then 
by starting at the bottom of the image and 
looking upward for “rocks” we can create a 
monotonic depth map of the distance to rocks 
in various dilutions. This is a simplified ap- 
proach because it assumes that the ground is 
pretty flat. However it works surprising well 
on the Viking test image. Observe the series 
of images above. Number one is a 64 by 48 
image as seen from the Viking Lander. Ignore 
the lander itself in the image, and assume a 
rover based camera can be mounted in the 
front where the rover itself will not be visible. 
In image 2 we have sharpened the edges of the 
images, and in image 3 we have used a simple 
edge detector. Finally image 4 was made by 
using an intensity threshold. Notice that the 
larger rocks in the original image are visibly 
segmented in the final image. By adjusting 
the threshold it is possible to segment for 
rocks of various sizes. This method is based 
on one devised by Horswill [4] and runs in re- 


altime on the Mobot Vision System with pro- 
cessing to spare. We have used this system in 
several robots to avoid obstacles on flat uni- 
form floors (usually carpets or cement). It 
works quite robustly. 

The second algorithm is based on mo- 
tion. We have implemented an algorithm that 
calculates the magnitude of the optical flow in 
the direction of the intensity gradient in real- 
time. If we make the assumption that the 
robot is moving forward roughly in the direc- 
tion of the camera than the rate of motion of 
an obstacle is proportional to 1/d where d is 
the distance of the obstacle from the camera. 
This means that there will be very little move- 
ment in the center of the direction of travel, 
and more on the edges. Objects will acceler- 
ate rapidly as they approach the camera [8]. 
This large movement can be seen as increased 
flow, and with thresholding nearby obstacles 
can be loosely isolated. An even simpler strat- 
egy is to turn the robot in such a manner as to 
balance the flow on either side the robot. If 
there is more flow on the left go right, and 
vice versa. A large amount of flow in the 


Optical Flow Examples 



Light dots indicate flow. Arrow is Image of an office with a person Flow of center image, arrow 

preferred direction walking by points away from flow 


694 



lower area of the image indicates an rapidly 
looming object. This technique is very similar 
to that employed by a number of flying in- 
sects. Balancing flow works well in a moving 
agent to avoid static objects. Some examples 
of this in action can be seen here. The line 
with the square on the end is an arrow indicat- 
ing the direction the robot should go, the cross 
indicates an estimate of the direction of mo- 
tion. 

Because our algorithms run in real 
time, it is not necessary to convert to a com- 
plex three dimensional map of the world. We 
can convert straight from an image based map 
to a simple robot relative map. This is actual- 
ly much more useful to a robot, and is vastly 
less computationally intensive. The realtime 
nature of our calculations applies a temporal 
smoothing to any errors made by the algo- 
rithm. This type of vision calculation tends to 
be very noisy, but with temporal smoothing, 
this noise is greatly reduced without have to 
resort to very difficult and time consuming 
calculations. 


7. Conclusion 

The images sampled by visual sensors, 
be they organic eyes, or inorganic cameras, 
contain a great deal of useful information 
about the environment, often available at more 
distance than other sensors allow. We believe 
that the limited amount of processing avail- 
able on our Mobot Vision System, when com- 
bined with a low resolution image and reason- 
able assumptions about the environment will 
be sufficient to provide a robot with a good 
deal of useful information. We also beUeve 
that a small, light, cheap vision system such as 
this could be applied to many branches of the 
robotics field, including space exploration, in- 
door, and outdoor robots. 
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Al ys tr a ct 

The paper discusses the sensory, control, and 
operation systems of the "MITy" Mars micro-rover. Its 
compact and low-power sensor suite, with customized 
sun-tracker and laser rangefinder, provides dead reckoning 
and hazard detection in unstructured environments 
without aid from external sources. A high-level task 
architecture supports mapping, obstacle avoidance, 
GN&C, mission monitoring, and ground telemetry with 
high processing efficiency. For cluttered environments, 
reactive obstacle avoidance chooses the clearest 
trajectories with non-holonomic steering constraints and 
passing margin tradeoffs. Wireless operator interactions 
range from troubleshooting and reprogramming to 
graphical monitoring and supervisory control of the 
robot. The micro-rover system has been simulated in 
Monte-Carlo trials and field tested in various 
environments. Continuing work focuses on space 
qualification of the sensors and control software and 
further implementation of the ground station. 

L Packground 

1 . 1 The MITy Mission 

MIT and Draper Lab-sponsored development of a 
low-cost Mars micro-rover began in 1990 and has since 
involved seventeen graduate and undergraduate students. 
This project supports the NASA MESUR objective of 
landing a micro-rover on Mars to scout and perform 
experiments on the environment. The "MITy” project 
goal has been to develop prototypes for this mission, 
which imposes strict constraints on size and mass, aid 
from external sources, and modes of operator interaction. 
The proposed operation scenario was to receive a set of 
destination commands from Earth daily, arrive at each to 
perform and record an experiment, then transmit the 
results back to Earth on the next cycle. Destinations 
would be chosen from rover video images and satellite 
maps. The current design scenario also allows 
supervisory monitoring and control for terrestrial and 
inner-space applications, in which communications are 
less limited. 
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Present contributions of the MITy project include 
studies on system requirements, design for mobility, 
soil-wheel interactions in low gravity, and most recently 
the sensory, control, and operation systems in this 
report [1,2, 3,4,5]. Current team developments include 
system qualification procedures with JPL, vision for 
navigation and hazard detection, refinements of mobility 
and packaging, 3D simulation and animation, and 
robotic manipulator construction. 

1.2 System Components 

The second of three prototype platforms, MITy-2, 
is depicted in Figure 1, Its sensor subsystem is 
described in detail in Section 2. 



Figure 1; MITy System 


Rover Breakdown 

The micro-rover structure consists of three 15x15 
cm articulated platforms, connected by a dual-spring 
suspension, that support sensing, processing, and 
payload subsystems. Six independently driven wheels 
enable the rover to climb steps up to 15 cm high and 
drive at speeds up to 30 cm/s on flat ground. The front 
and rear wheels are independently steered to permit a 63 
cm minimum turn radius. Power is provided by a 30 
amp-hr capacity battery, which can be recharged by a 
solar panel at a maximum rate of 6 W. The overall 
dimensions of the robot are 46 x 75 x 28 cm, and its 
total weight is about 9 kg. 




The processing subsystem consists of a Z-180 
based micro-controller, with a throughput of about 12 
Kflops/s and 512 Kb for code and data storage. 
Customized drivers generate motor and communications 
signals for a high level control interface. Software 
development supports and remote debugging. 

The current payload on the rover is a CCD camera, 
which sends images over a video transmitter to the 
operator station. Data communication occurs over a 
9600 baud radiomodem with a LOS range of 2 miles. 
An optional LCD/keypad, not displayed, provides much 
of the operator station functionality for testing 
purposes. 

Operator Station Breakdown 

The main component of the operator station is a 
486/66 PC-clone laptop, which runs the graphical user 
interface and programming environment. Operator 
devices are the computer display, TV/VCR, keyboard 
and trackball; communication devices are the 
radiomodem and video receiver. The small TV/VCR is 
useful for real-time supervisory control. These 
components are powered a 12 V car battery, which 
makes the entire operation station portable for testing 
and relocation. 

1.3 Related Systems 

Other notable Mars micro-rovers in development 
include JPL's Rocky, RATLER from Sandia, and 
Genghis from ISX Robotics, which differ largely in 
terms of system capabilities and realization. For 
instance, MITy is geared toward the extended MESUR 
mission beyond 10 m of the lander, while Rocky has 
been designed for operation within 10 m [6]. Larger 
rovers include CMU's Ambler and the Mars Rover from 
JPL. 

2. Sensory System 

2.1 Sensory Objectives 

The sensor suite on the rover is required to provide 
sufficient navigation and hazard information to the 
control system for autonomous transit [1]. In 
particular, the subsystem requirements state that the 
rover position and attitude {x,y,z,0,(p,^} from the 
landing location should be determinable to within 10% 
of traveled distance. The hazard requirement states that 
the rover should be able to detect and localize potential 
hazards between the range of 1-10 ft, and within 1 ft be 
able to sense all types of accountable hazards. At the 
micro-rover scale these hazards include rocks, craters, 
and steep grades. Sensor selection, development, and 
placement are features of this problem. 


Constraints that apply to the sensory system are to 
minimize power consumption, size and mass, and 
prototype cost. The sensory system should not rely on 
external sources and should be operable in the negligible 
atmosphere and magnetic field of space. Redundant 
sensors are also desirable in case of failures. 

2.2 Navigation Sensors 

By keeping within the 10% navigational error 
bound, the video image of the landscape can be 
compared with that from the previous day, reducing 
cumulative position error over multiple days. The video 
image plays an important role in navigation, since 
selection of the daily goals as well as position 
calibrations are based entirely on this data, as analyzed 
by the ground station [7]. The result is that the 
performance requirements of the rover navigation 
sensors are greatly relaxed, reducing cost, power, size, 
and complexity. 

Dead reckoning is used to navigate to the stated 
accuracy, since this does not require an existing 
infrastructure such as land beacons or GPS satellites. 
Rather than using a costly inertial navigation package, 
the dead reckoning sensors are divided into three types: 
longitudinal translation, heading angle, and inclination. 

Translation 

Longitudinal translation is measured with the 
powered drive wheels of the rover, as well as an 
unpowered drag wheel. Drive wheel rotation is sensed 
with motor tachometers, which are supplied for speed 
control. An optical encoder is used to sense the drag 
wheel rotation. Both drag and drive wheel sensing is 
used to accommodate the potentially large variety of 
terrain. A drag wheel is beneficial when the powered 
wheels slip on soft surfaces, or when the vehicle wheels 
becomes partially unloaded due to rocks underneath its 
belly. However, since the drag wheel is smaller for 
packaging reasons it is less accurate than the drive 
wheels over rough terrains. 

Heading An gle 

Heading angle sensors include a sun sensor and an 
inertial angular rate sensor, hereafter called the gyro. 
The gyro is useful regardless of environment conditions, 
but at a cost of unbounded error growth with time. The 
sun sensor is used for dynamic calibration when shading 
is not an issue. 

The selected rate gyro is a low-cost silicon 
vibrating beam sensor made primarily for automotive 
applications. The rate signal is integrated with a low- 
leakage analog circuit to provide a voltage signal that is 


proportional to heading angle. Bias error in the gyro 
and integrating circuit is measured at stopping points for 
substraction in software. 

The sun sensor was designed and constructed at 
CSDL, due to its unique requirements of a 
hemispherical field of view, small size, non-scanning, 
and low cost. To achieve these characteristics, a two- 
axis position sensitive detector (PSD) measures the 
position of a spot of light focused from a small fisheye 
lens. The position of this light spot is a function of the 
sun elevation (e) and azimuth (P), as well as the 
inclination of the rover, as shown in Figure 2. Simple 
electronics are required to obtain light spot position 
from the PSD currents, making the entire sun sensor 
assembly a small rugged device that provides heading 
calibrations to within 0.5°. 



Figure 2: Sun Angle Sensor 

Inclination 

Two accelerometers provide tilt information for sun 
sensor calibration and propagation translation 
measurements out of the ground plane. These sensors 
are described in more detail below. 

2.3 Hazard Sensors 

One of the most difficult challenges of a micro- 
rover is sensing mobility hazards in an unstructured 
environment. The primary sensing problem is detecting 
objects for collision avoidance, although other mobility 
hazards exist. 

Adequate sensing for a mobile robot typically 
requires a detailed depth map of the local terrain in all 
directions. The combination of high resolution and 
large field of view results in a large amount of data that 
requires much processing. Even if such processing were 
available, collecting the data with a small sensor is 
difficult. The MITy design presents a simplified hazard- 


avoidance sensor arrangement that meets its packaging 
requirements, and provides limited but sufficient sensing 
capability for autonomy. It is composed of a single 
axis scanning laser range finder, short distance 
proximity sensors, bump switches, and inclinometers. 

Range Finder 

Like the sun sensor, the range finder was designed 
and developed at CSDL in order to meet the unique 
needs of the micro-rover. For size and simplicity 
reasons, it works on active triangulation principles, 
illustrated in Figure 3. A solid-state laser produces a 
collimated beam in the near infrared. Light is diffusely 
reflected from the target, and a portion of this weak 
signal is collected by a small lens and focused to a spot 
on a 1-axis position sensitive detector (PSD), similar to 
that used on the sun sensor. Since the receiving lens is 
located a small distance from the laser, the angle of 
incidence of reflected light varies with range. Through 
triangulation, range can be calculated based on the 
reflected light angle, which is measured in the form of 
light spot displacement at the PSD. 



Figure 3: Triangulation Rangefinding 

With simple electronics for transmitting and 
detecting, the complete range finder remained small, and 
yet ranges out to 3 m with 10% accuracy, which was 
specified from control system simulations [4]. The 
range finder is mounted to the front platform, and scans 
180° in front of the rover at a height of 15 cm on flat 
ground. This plane-of-view approach is limited but 
provides adequate collision avoidance capability when 
used with other sensors. 

Inclinometers 

Sensing the tilt of a given rover platform is 
important for both navigation and hazard-avoidance 
reasons. The latter requires inclinometer data to 
calculate the orientation of the range finder beam, and to 
estimate the terrain geometry. 

Accelerometers are used to sense the component of 
gravity when tilted, rather than the bulkier bubble level 
sensors. Miniature non-inertial grade accelerometers are 
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relatively small and faster, allowing pairs for pitch and 
roll on the front and middle platforms. 

Proximit y Sgn§pr$ 

Because of the planar range finding, vertical drops 
are sensed through two short-range proximity sensors. 
These are used on the front platform along with 
inclinometer data to estimate when the rover is partially 
over the edge of a cliff. The rover can then travel in 
reverse, using the high traction of the four rearward 
wheels. 

Collision Sensors 

Contact switches are located at the front and rear 
edge of the rover to detect collisions above a certain 
force. In addition, the odometry system can provide 
collision detection by observing front wheel speeds in 
response to commanded torques. 

2.4 Implementation 

The locations of individual sensors on the micro- 
rover are shown in Figure 1. All of these sensors are 
relatively low-performance, which allows them to be 
small, light, low-cost, and rugged. The total volume, 
mass, and power for the sensing system are about 2000 
cm^, 5 W, and 0.5 kg respectively, with a prototype 
cost of under $5 K. Rather than concentrating on fewer 
sensors that are high performance, this arrangement 
accomplishes similar tasks through sensor fusion in 
software, and additionally provides a certain degree of 
redundancy, increasing the system reliability. 

3. Control System 

3.1 Control Objectives 

The semi-autonomous control problem is transit 
between a sequence of coordinate locations, without 
position or environment information from a priori or 
external sources. In addition, with sufficient 
communication rates, supervisory control should permit 
real-time operator interaction with sub-tasks, such as 
guidance and mapping. The robot should recognize 
traps that confuse its control logic, and be able to be 
recovered by the operator under supervisory mode. 

Mobility, sensing, and processing constraints affect 
control system design. Mobility constraints include the 
minimum turn radius, overall dimensions, and climbing 
ability. Range, accuracy, and throughput of sensors 
must be recognized by the design, as well as processing 
throughput and memory. 


3.2 Functional Breakdown 

This control objective precipitates a variety of 
concurrent tasks, including mapping and obstacle 
avoidance, GN&C, mission monitoring, and ground 
telemetry. A functional breakdown of the autonomous 
transit mission is depicted in Figure 4. Boxed tasks 
perform an atomic function, which improves modularity 
and lessens code redundancy, utilizing other tasks and 
circled resources. Map and Avoidance tasks are related 
to trajectory generation. GN&C encompasses the 
Guidance, Nav, Speed, and Steer tasks. Lastly, 
Sequence, Collision, Failure, and User tasks comprise 
mission monitoring. The diagram also shows 
information transfer between tasks and resources. Task 
groups and their implementations are described below. 



3.3 Task Architecture 

The task architecture determines the scheduling of 
tasks and communications between them. The MITy 
architecture was mostly influenced by Payton's reflexive 
control ideas, which were implemented on DARPA’s 
ALV [8]. Tasks are divided functionally and by cycle 
rates. Ideally each task would run concurrently, but this 
requires customized low-level scheduling and 
interprocess communications on a single processor. 
Instead, tasks are broken down into fast executing steps, 
which are interleaved by the task planner according to 
their intervals and priorities. The three types of tasks 
are main, support, and background; the type determines 
how a task is handled by the planner. Main tasks 
compete for motion control, support tasks aid main 
tasks, and background tasks aid all tasks. Information is 
communicated between tasks through a global variable 
pool. 

All tasks are divided into perceptions and reactions. 
The planner decides which reactions to run based on the 




‘'truth” of their perceptions. Any condition may exist in 
a perception, but every task has a flag and interval, 
which are set by the user to determine if and how often a 
reaction should be called. Background tasks have only 
these two conditions; support tasks will run only if 
their associated main task is running. Only one main 
task may run at a given time, which determines its rank 
as it executes. Main task perceptions have exclusive 
and positive priorities which are compared to the current 
reaction rank when true. A higher priority will 
subsume a lower rank, else the current reaction will 
continue without interruption. When a main task 
reaction ends without interruption, its rank is reduced to 
zero to allow the next highest priority main task to run. 

This high level method is more efficient for the 
particular task designs than common real-time 
schedulers, which interleave tasks at the operating 
system level. The MITy task planner supports both 
pre-emption and concurrent execution at step 
resolutions, and does so without the overhead required 
for low-level context switching. Also, schedulers are 
not standardized and often unavailable for many 
processing platforms, which would lessen portability of 
the control code to future robots and simulations. 

lA Trajectory Generation 

Trajectory generation tasks produce heading and 
speed commands that maneuver the robot around 
obstacles toward the current target. 

Ma pp in g 

The mapping function is a support task for 
obstacle avoidance. It sweeps the laser scanner 180° in 
10° intervals, sampling the rangefinder at each stop. A 
reading is considered valid if its intensity is sufficient 
and the laser is not aimed at the ground. The last two 
sweeps of data are stored in a circular list, whose 
elements correspond to particular scanning directions. 
Elements consist of the coordinate endpoint of the last 
valid laser reading in that direction. This storage 
method constantly refreshes the obstacle list while 
maintaining complete coverage for asynchronous 
obstacle avoidance. 

Obstacle Avoidance 

The obstacle avoidance routine makes intelligent 
use of robot-centered laser information to maneuver 
through cluttered environments. It is the lowest priority 
main task-it is active in the absence of emergencies and 
consumes available processing time between other 
tasks. The general philosophy is borrowed from the 
VFH method [9], which is reactive in nature in that it 


does not plan ahead and must cycle quickly relative to 
robot motion. It has been shown more stable and 
predictable than methods based on artificial potential 
fields. 

In the MITy approach, represented in Figure 5, 
polar map creation and trajectory selection are quite 
different from VFH. The independent axis of the polar 
map represents trajectory headings tangent to the 
minimum turn radius of the robot, while the dependent 
axis shows the "free distance," which is initially 
calculated as the distance to the first collision in all 
directions. The robot model is a circle that includes 
both its physical dimensions and the obstacle 
uncertainty due to laser and scanner accuracy. Free 
distance is then linearly traded off with safety, target 
heading error, and momentum heading error to obtain 
the weighted polar map. Safety is defined as the 
minimum distance to an obstacle in passing, which 
tends to guide the robot through the centers of clearances 
rather than narrowly on a side. The best trajectory is 
chosen in the direction of the highest free distance after 
weighting. 

In constrast, the VFH method considers vehicle size 
at only one range, does not contend with steering 
constraints, and does not trade off safety and target error 
with free distance. It is therefore better suited to 
omnidirectional vehicles operating with a short range of 
concern. 

The routine uses prediction and error correction to 
keep the robot in motion between trajectory updates. 
The nominal radius of concern and trajectory speed are 
respectively 3 m and 8 cm/s. If the robot sees no way 
out of a situation by moving forward, it produces a dead- 
end signal for collision recovery. In proximity to the 
target, the search radius is limited to prevent avoidance 
of obstacles behind it. 

3.5 GN&C Module 

Guidance, navigation, and control tasks conceitedly 
command the drive* and steering motors to follow a 
given trajectory. 

G uidan<?e 

The guidance routine commands steering angles to 
meet a desired heading, using a proportional filter on 
heading error. It also stops the robot if it has not 
received a trajectory update after a fixed distance. 
Guidance supports both obstacle avoidance and collision 
recovery, depending on which produces trajectory 
commands. 
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Navigation 

Navigation integrates heading, rate, and inclination 
information to update {x,y,z,0,(p,^} of the rover. 
Redundant sensors are arbitrated rather than averaged; the 
primary sensor is used unless disqualified, either by its 
own or other sensor readings. The sun sensor 
outweighs the gyroscope with a non-zenith sun, and the 
drag wheel outweighs tachometers over smooth terrain. 

Control 

Drive and steering control loops are separate but 
cooperative background tasks. Speed control uses 
tachometer readings and an anti-windup integral filter to 
generate motor voltages. Voltage commands to 
individual wheels are offset in accordance with steering 
rates and angles, to help aid steering and minimize 
slippage. Prohibitive torques, calculated from applied 
voltage and measured speed, are reported as collisions. 
The servo loop fixes steering and scanning rates and 
bounds to prevent power surges and motor damage. 

3.6 Mission Monitoring 

Mission monitoring tasks react to hazard collisions, 
monitor targets for completion or failure, and report 
progress to the ground station. 


C oDisipn R e cove ry 

Rough terrain and reflective obstacles can cause the 
laser rangefinder to fail and may result in obstacle 
collisions. Collisions types are actual bumps, steep 
grades, deep craters, stuck wheels, and dead ends. When 
these occur, the collision task overrides obstacle 
avoidance to back the rover along its entry path, using a 
"look-ahead" path following method based on [10]. It 
then notes the collision location in the obstacle map for 
continued obstacle avoidance. If a collision is detected 
in reverse, the rover stops without updating the map and 
continues forward immediately. 

T a rg et Monito ring 

Two functions are performed by target monitoring: 
sequencing and failure detection. When the rover reaches 
a target, determined by an estimated passing distance 
less than the minimum turn radius, the sequencing task 
overrides both collision and obstacle avoidance to 
perform a desired experiment. It then advances the target 
counter until the last target is reached. A target is 
considered failed if the rover does not advance on it or 
escape a given radius after a given amount of travel, as 
prescribed by [11]. Failure results in mission stoppage 
to conserve energy until operator intervention. 

Telemetry 

The telemetry function can operate in either the 
background or foreground. In background mode, it 
periodically sends position, obstacle, and free distance 
updates to the operator station, while servicing non- 
destructive sensor sampling requests and modifications 
to targets and parameters. Alternatively, the user may 
override semi-autonomous mode to set trajectories 
directly in supervisory mode, in which case telemetry is 
the highest priority foreground task. 

3.7 Implementation 

The control code was developed in standard "C." It 
is easily ported between the robot and simulation 
(described in Section 5) by replacing stump I/O 
functions. 

Control parameters were initially determined from 
Monte-Carlo simulations and fine tuned on the actual 
robot. Obstacle avoidance throughput was traded off 
with background task cycle rates and obstacle mapping 
resolution and memory. As a result, most background 
tasks cycle at 10 Hz, while obstacle avoidance repeats at 
1 Hz. 


4> Operator Station 

4.1 Operation Objectives 

The capabilities of the operation station should 
include graphical monitoring and intervention for 
supervisory control, on-the-fly troubleshooting and 
reprogramming for semi-autonomous control, and data 
logging and replay for post-mission analysis of the 
rover. Communication bandwidth, time delay, 
interference, and range are restrictions on operation 
effectiveness. 

4.2 Real-Time Display 

A typical operator screen is shown in Figure 6, 
composed of a graphical window and text interaction 
areas. The text buttons emulate the optional keypad 
interface to the robot. 


Figure 6: Ground Station GUI 

A graphical window displays robot telemetry in 
real-time and allows user interaction with goal points. 
Robot position, obstacles, and free distances are updated 
independently at about 1 Hz onto a reference grid. These 
can be toggled for display as well as various driving aids 
for supervisory control. The scale and offset of the 
window relative to the robot can be adjusted on-the-fly. 
The screen follows robot motion by means of a travel 
boundary, which recenters the screen on the robot when 
crossed. 

The button interface provides all means of 
interaction with the robot, from parameter and goal 
changes to sensor sampling requests. It also backs up 
graphical interactions, such as goal positioning and 
window resizing, with precise entry ability. A message 
window displays robot responses to user interactions and 
shows the precise navigation state at all times. 


4.3 User Intervention 

The operator interacts with the robot in one of three 
operation modes: semi-autonomous, supervisory, ready. 
Human factor issues in monitoring and shared control 
are regarded, as introduced by Sheridan [12]. The robot 
is placed into ready mode on power-up. 

Re^y Mode 

In this mode the robot can service troubleshooting 
and reprogramming requests by the operator; all control 
parameters, goals, and sensors are accessible for viewing 
and modification. The drive system is inactive in this 
mode for safety reasons. Parameter sets may be saved 
and loaded from the operator station for testing or 
optimization purposes. The robot's initial position and 
path are set in this mode based on static video imagery; 
the mouse can set target destinations graphically. A 
panic button returns the robot to ready mode from other 
active modes. 

Supervisory Mode 

From supervisory mode the operator can command 
speed and steering to the robot. Arrow keys drive the 
robot forward or backward from 0-30 cm/s and can steer 
down to 63 cm arcs. Graphical aids for supervisory 
control include superimposed turning arcs, lines between 
targets, and unerased telemetry data. This mode is used 
for fine maneuvering or trap extraction, often relying on 
real-time video imagery in addition to position and 
obstacle telemetry. Although obstacle avoidance is 
inactive on the robot, it will still stop for collisions in 
case of operator error. To push an object or climb out 
of craters, collision parameters can first be modified in 
ready mode. 

Semi-Autonomous Mode 

In its baseline mode, the micro-rover autonomously 
travels between target destinations, which are initialized 
in ready mode. The operator may guide the robot in 
real-time by moving the current target with arrow keys, 
which move it radially and axially about the robot. The 
rover performs homing and obstacle avoidance at its 
nominal speed, freeing the operator to interact only 
when required. Real-time video can provide long-range 
information to the operator for maneuvering the rover 
around hazardous regions, rather than individual hazards. 

4.4 Analysis Tools 

Post-mission analysis involves logging and 
replaying telemetry data at a desired rate and display 
perspective. Telemetry signals may be recorded to a file 
from any mode during the mission. At operator station 
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startup, either real-time or replay operation may be 
chosen. The latter allows the user to load a telemetry 
file and analyze it with the same graphical aids available 
to the real-time mode, even if those aids were not active 
during recording. 

4,5 Im plempntatiQn 

The operator station code is currently written in 
BASIC. A transmission delay of 0.2 sec was found by 
comparing the speed of implementing a simple 
command from the remote operator station to that from 
the optional keypad on the robot. This delay consists of 
both protocol overhead and radiomodem throughput; it 
is minimized by customized asynchronous and buffered 
message passing. 

5. Performance Results 

The MITy system has been implemented and tested 
in both the field and simulation. The simulation is used 
for development and Monte-Carlo performance 
evaluations. 

5.1 Simulation 

The rover simulation embodies the control code and 
two dimensional models of the platform and 
environment. It runs on an IBM/320 workstation at 
about 100 times actual rover speed. 

Modeling 

The vehicle, sensors, and environment models are 
low-fidelity, two dimensional representations intended 
for control system development. The vehicle is modeled 
kinematically as a bicycle with first-order steering 
dynamics. Perfect traction and rigid body assumptions 
are made at nominal speed, which has been shown 
reasonable for the micro-rover in low gravity packed 
powder environments [2]. The laser scanner and 
inclinometer models also have first-order dynamics; the 
drive motors, platform suspension, and other sensors are 
quasi-static at the time scale of interest. Rover 
dimensions and locations of sensors are represented 
faithfully. The laser is modeled as a ray and the 
bumpers as line segments in an environment of circular 
obstacles. All time constants and kinematic parameters 
were determined from experimental data, and sensor 
returns are considered ideal. 

Monte-Carlo Statistics 

Performance statistics were compiled on batch runs 
of the rover through random obstacle fields. In the 
nominal run, the rover must travel to a target 46 m (50 
yd) away through 25 cm obstacles with 6% aerial 


density, the median distribution of rocks on Mars [13]. 
Runs were arbitrarily considered failures if the rover 
traveled over 92 m or crossed outside the 31 x 61 m 
field boundaries. 

Measures of roVer behavior and performance were 
selected by inspection and correlation studies, which 
describe time and power usage, path features, overall 
safety and navigational error, and failure modes of each 
run [4]. For a batches of 100 runs, statistics on these 
measures were compiled over variations in obstacle 
fields and control parameters; a few of these are listed in 
Table 1 for various obstacle sizes and densities. The 
mean passing distance is the average distance to the 
closest obstacle, which reflects rover safety. Normal 
deviation is of the distance away from the straight line 
path to the target, to estimate the area necessary to 
penetrate a given obstacle field. The total path distance 
shows power usage, while the distance in reverse 
indicates collision frequency. These measures were only 
compiled for successful runs, which is the foremost 
indicator of performance. 


Table 1: Statistical Measures 


performance 

measures 

obstacle radius (cm)/ aerial density (%) | 

25/6 

25/10 

25/3 

30/6 

1 20/6 

mean pass (m) 

0.8- 

0.7 

1.2 

0.9 

0.7 

norm dev (m) 

1.4 

2.5 

4.3 

1.2 

2.1 

total dist (m) 

59 

94 

50 

54 

77 

rev dist (m) 

2.5 

14.2 

0.4 

1.3 

8.4 

success (#) 

100 

44 

99 

99 

94 


Case Studies 

Individual case studies highlight special abilities of 
the MITy control system. A rover that can recover from 
dead-ends without operator intervention, as shown in 
Figure 7(a), is desirable for cluttered obstacle fields with 
limited sensor ranges. In 7(b), the rover escapes the 
type of large concave obstacle that often troubles 
potential field methods of obstacle avoidance. Lastly, 
methods that ignore turn radius constraints would suffer 
in the hole-in-the-wall test, which demonstrates the 
unification of target homing with obstacle avoidance in 
7(c). 

5 ,2 Field Te$t$ 

The rover and portable operator station were 
transported to various locales to test real system 
performance. The results presented here are qualitative 
and more telling of hardware performance than the 
simulation. 




(a) dead end (b) concave obstacle (c) hole-in-wall 

Figure 7: Simulated Case Studies 


Effects of Environment 

Navigation and hazard detection were affected by 
different environments as listed in Table 2. Only 
problems are noted; otherwise, the rover performed as 
expected. Specular reflection and misorientation of the 
laser increased the frequency of collisions. Navigation 
was generally over 95% accurate outdoors, even over 
sand and under shade, due to dynamic sensor arbitration. 


Table 2: Environment Effects 


Environment 

Nav. problems 

Hazard problems 

Hallways 

Pavement w/ 
traffic cones 

Sandy beach 
w/ sand piles 

Rocky 

beach 

Grass field 
w/ people 

gyro drift 

sun reflectance off 
building windows 

wheel slippage on 
sand 

wheels not in 
contact w/ ground 

no problems 

specular reflection 
at low incidences 

specular reflection 
on edges of cones 

specular reflection 
off polished sand 

pitching/rolling of 
laser scanner 

no problems 


Supervision Effectiveness 

Overall system performance and robustness were 
increased by using the various rover modes in 
conjunction with each other, which eases user fatigue 
during semi-autonomous segments while allowing 
detailed supervisory operations. In fact, much of the 
control system was debugged using the operator station 
for the insight and flexibility it provides. Real-time 
video images were mainly useful for target homing 
rather than obstacle avoidance, because the spatial 
relationship between the rover and observed obstacles 
was not intuitive to the user. 

6. Continuing Work 

Future plans in sensing are to incorporate a quartz 
gyro and phase-locking to a modulated laser for more 
accuracy and less power. The operator station is 
currently being integrated with customized 3D 
simulation and animation packages for operator training 
and further system verification. Further team effort will 
hopefully culminate in space qualification of the final 


MITy prototype for consideration in the NASA 
MESUR mission. 
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Abstract 

Recent studies of the types, numbers, and 
roles of robotic systems for use in human space 
exploration, including the First Lunar Outpost 
(FLO) mission, with a focus on planet surface 
systems are summarized in this paper. These high 
level systems engineering modeling and analysis 
activities have supported trade studies and devel- 
opment of preliminary requirements for intelligent 
systems, including supervised autonomous robotic 
systems. The analyses are summarized, results are 
presented, and conclusions and recommendations 
are made. 

One conclusion is that space exploration will 
be "enabled" by the use of supervised intelligent 
systems on the planet surfaces. These intelligent 
systems include capabilities for control and 
monitoring of all elements, including supervised 
autonomous robotic systems. With the proper 
level of intelligent systems, the number and skills 
of humans on the planet surface will be deter- 
mined predominantly by surface science and 
technology (not outpost) objectives and 
requirements. 

Space robotics, especially those systems being 
developed to operate on planetary surfaces, can be 
considered a form of the emerging technology of 
field robotics on Earth. The solutions to the 
problems we will be solving to make the explora- 
tion of our solar system possible and practical will 
apply to the many critical problems we have that 
require operating in hazardous environments on 
Earth and to improving human productivity in 
many fields. 


* Chief Scientist, Automation and Robotics 
Division, Member AIAA 
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1. Introduction 

Human space exploration is a strategy for 
stimulating the United States, its people, and its 
economy as much as it is a strategy for exploring 
the Moon and Mars. A White House report 1 has 
outlined various visions and architectures for this 
crucial effort. We take the position in this paper 
that the greatest benefit to the U.S. economy of 
any space-exploration-related technology can 
come from the development of supervised intelli- 
gent systems, including supervised autonomous 
robotic systems. Such systems are mandatory for 
space exploration 2 to improve safety, reliability, 
and productivity, while enabling large cost savings 
through minimizing logistics 3. Such systems are 
also needed in the U.S. economy A 5. 

Intelligence is the ability to acquire and apply 
knowledge and skills to achieve stated goals in the 
face of variations, difficulties, and complexities 
imposed by a dynamic environment having signifi- 
cant unpredictability. Intelligent systems are 
composed of sensors for sensing the "world," 
effectors for acting on the world, and computer 
hardware and hardware and software systems for 
connecting the sensors and effectors in which a 
part of the processing is symbolic (nonnumeric). 
This processing enables practical reasoning and 
behavior, which in humans we call intelligence. 

Examples of artificial intelligence capabilities 
in intelligent systems are knowledge based 
systems, expert systems, natural language 
understanding systems, robotic visual perception 
systems, intelligent control and planning systems, 
qualitative and model-based reasoning systems, 
and supervised autonomous robots. Many supply 
an explanation facility that enables the user to ask 
what reasoning was used and why the conclusions 
were reached. Intelligent systems can be of four 
basic kinds: nonmobile, nonmanipulative systems 
such as monitoring and control systems; nonmo- 
bile, manipulative systems such as robot arms fixed 
in place at the shoulder; mobile, nonmanipulative 
systems such as inspection robots; and mobile, 
manipulative systems such as mobile robots with 
arms and end-effectors. While supercomputers, 
distributed computers, or parallel computers are 
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currently required to achieve real-time perfor- 
mance with large scale intelligent systems, CPU 
speeds double every 6 months, so such intelligent 
systems will be easier and cheaper to achieve in the 
future. 

It is important to understand the advantages 
intelligent systems have over conventional auto- 
mation. Some advantages are given by Erickson 6, 
which are primarily perception and flexibility in 
dealing with uncertainty and dynamics imposed by 
real environments. 

The benefits of using intelligent systems in 
space missions are improved and increased safety, 
reliability, and productivity. These benefits are 
derived from applying more knowledge and 
reasoning in more flexible and appropriate ways 
than conventional automation. 

The EVA helper/retriever effort 7 is an initial 
attempt to build and understand a limited version 
of a supervised autonomous robot for use in space. 
Many other efforts to build intelligent robotic 
systems, not necessarily for space, are under way 8. 


2. Space Exploration Studies 

Recent studies^ of the types, numbers, and 
roles of robotic systems for use in the 20-year 
Option 5A space exploration mission 10, with a 
focus on planet surface systems, are summarized in 
this section. These studies employed high level 
systems engineering models that we developed. 

We now employ a software modeling tool, the 
mission simulation and analysis tool (MSAT) 1 1, 
which enables us to account for the nonlinear 
effects of resource allocation, parallel support and 
mission tasks, and occurrence of contingencies. 

Mission feasibility is a paramount issue in 
requirements generation (along with verification, 
validation, and traceability). A useful device that 
exercises the skill and judgment of those 
concerned with requirements is to tell the story of 
the mission. 

These stories form the basis of input to MSAT. 
Any mission story will be in the form of a process 
description. At the requirements stage, the story 
of any subprocess (such as landing on the surface, 
unloading, etc.) will be in terms of objects specified 
by functionality, not by actual design. As the stage 
progresses toward design, the stories will involve 
process designs and objects wherein performance 
and operational parameters can be quantified. 


With the process description format, each 
mission story is told in terms of parallel processes, 
each with prescribed start times. Each process has 
a functionality type; at present the types used in 
MSAT are the following: 

• Mission backbone (e g., landing, launch, site 
preparation) 

• Science 

- EVA: geologic traverses, astrophysics, 
geophysics 

- IVA: labexperiments, life sciences, analysis, 
packaging 

• Maintenance 

- Dusting 

- Servicing 

- Repair (EVA, IVA) 

- Replacement 

- Testing 

- Inspection 

• Logistics 

• Support 

- Power 

- Thermal control 

- Communications 

- Crew safety and well-being 

Each process is broken down into subpro- 
cesses, called stages, and each stage has a set of 
options corresponding to the different ways in 
which the stage can be carried out. Each stage 
option has a model assigned that enables compu- 
tation of elapsed time versus stage option name 
and the types of agent resources to be used : 

• EVA, IVA, and equipment 

• EVA, robotics, and equipment 

• IVA, robotics, and equipment 

• EVA, IVA, robotics, and equipment 

• Robotics only and equipment 

MSAT is written in (interpreted) C, which is an 
application running under the Ellery Open System 
(EOS) 12. EOS is a development and run-time envi- 
ronment for distributed computing applications. 
MSAT is a relational, table- and model-driven simu- 
lator that makes allowances for parallel processes 
and dependencies, for supply and demand of 
resources to accomplish processes, and for elapsed 
time in accomplishing mission processes and tasks. 

In constructing the Option 5A models, we first 
reviewed the story of the mission from previous 
accounts that tells what is intended to be done 
during the mission with flight times, site layouts. 
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element and system descriptions, and manifests. 
Then we examined various advanced automation 
and robotics issues raised by the story. After estab- 
lishing two differing points of view, a conventional 
systems view and an intelligent systems view 
(causing changes in the equipment manifests and 
in the way mission tasks are carried out), we rede- 
scribe the missions from those points of view (see 
Table 1) and construct two models corresponding 
to these views for the purpose of obtai ning 
comparative results. 

Numerical models were constructed only for 
control and monitoring, unloading, site survey and 
regolith handling, emplacement, servicing, and 
maintenance. 

Figure 1 shows results from modeling crew 
workload demand for selected tasks versus crew 
EVA availability under the conventional systems 
model for four astronauts in 21 years of lunar 
missions. As can be seen, either more capable 
equipment, as in the intelligent systems model, or 
more crewmembers are required. This mission 
scenario, which calls for complex activities such as 
offloading of large equipment and construction of 
facilities in the absence of humans at the planet 
surface site, clearly makes intelligent robots 
mandatory. Figure 2 shows the crew EVA demand 
under the intelligent systems model and shows 


primary crew time for creative activities of 
exploration, science, and planetary resources use. 
Science and engineering skills in the crew may now 
replace some pilots and technicians. A supervised 
autonomous outpost is thus seen as mandatory to 
preserve small crew sizes and ambitious surface 
mission objectives. 

A broad range of robotic system uses in Earth 
orbit or during space transport is indicated by 
current studies. These include assembly of very 
large spacecraft systems such as propulsion systems 
and aerobraking structures'll. Maintenance of 
onboard equipment in Earth orbit or during space 
transport is another robotic system use being 
studied. 


3. First Lunar Outpost Studies 

This section is based on Erickson which has 
more details. The JSC Automation and Robotics 
Division (A&RD) has been performing high level 
systems engineering modeling and analysis 
activities to support trade studies and systems 
effectiveness analyses for proposed missions to the 
Moon and Mars. Preliminary requirements for 
intelligent systems, including supervised intelligent 
robots, have been the focus of our efforts. 


Table 1 - Conventional systems versus intelligent systems 


• Use Fisher-Price'^ recommendations 

• Conventional software 

- DDBMS for knowledge representation 

- Normal sensors 


• Mainly surface teleoperation, limited 
telerobotics 

• Rudimentary, mainly Earth-based DOKSS 

• Ground-based control and monitoring (for 
Moon) 

• More-than-minimal computing power 

• Predetermined procedures 

• Limited surface diagnosis and repair 

• Limited surface communication, major 
downlink 

• Crew used for outpost operations and 
maintenance, science and technology 
deployment 


• Use Fisher-Price recommendations 

• Intelligent system software 

- State-of-affairs knowledge representation 

- Extensive sensors/perception for knowledge 
acquisition 

- Ability to use knowledge 

• Supervised, autonomous robotics with structured 
environments 

• Distributed DOKSS, real time where needed 

• Surface-based, built-in control and monitoring 
with ground-based oversight 

• Major computing power and information storage 
on surface 

• Adaptable procedures with built-in precautions 

- Rehearsals 

- Interelement and interface testing 

• Design for ease of testing, diagnosis, servicing, 
maintenance, and repair 

• Major surface communication, major downlink 

• Crew used for science and technology, minimal 
outpost operations and maintenance 


707 



G toul hours for maintcnsnce. unhiding, servicing, emplacement, survey, and ~ EV A time available for Option 5 A crewsire of 4 
regnlith handling if performed by EVA 



2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 

YEAR 

Figure 1 - EVA allocation for conventional systems approach. 


Simulation is concerned with identifying and 
solving problems by testing how well the opera- 
tion of engineered designs will meet the mission 
objectives. Simulation of operations can provide 
early identification of performance problems of 
integrated design and operations concepts. When 
applied with alternative process and equipment 

□ EVA time for science 


designs, simulation of operations is used to obtain 
a less costly short cycle run-break-fix approach 
that can be iterated until simulations do not 
"break" anymore. Specialty engineering analyses, 
particularly reliability and maintainability, are 
most effective when implemented early in the 
design process when they can have the greatest 
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Figure 2 - EVA allocation for intelligent systems approach. 
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impact on overall design decisions. The JSC A&RD 
has developed MSAT for use in evaluating the fea- 
sibility and effectiveness of proposed mission 
concepts. We have also used a reliability and main- 
tainability assessment tool (RMAT) developed by 
the JSC Reliability and Maintainability 
Division 17, i8 forSSF applications to estimate the 
amount of maintenance for the lunar surface 
habitation element. 

The FLO mission, while being significantly 
more complex than any single Apollo flight, is 
vastly less complex than the Option 5A mission 
analyzed previously. Although there are periods 
when humans are not present, no offloading of 
equipment or construction of facilities is planned 
when crew is not present. Maintenance of facili- 
ties will still be required. We have continued to 
perform mission simulation and analysis to support 
system effectiveness studies of FLO and to under- 
stand the requirements for intelligent systems 
automation and robotics. 

The FLO mission is envisioned as the first way- 
point in expanding human presence in our solar 
system. FLO is established by the successful landing 
of an unoccupied human habitation element on 
the lunar surface and a subsequent 42-day visit by 
a crew of four that is transported to the lunar 
surface in a separate crew lander. The crew will 
arrive from 2 to 6 months after the cargo vehicle 
with the habitation element has landed. The 
habitat will be activated and checked out remotely 
before crew departure from Earth and will be in a 
ready state for crew arrival . Revisits to the outpost 
are projected at intervals of about 6 months. 
Humans are not present at the outpost during this 
interval; however, the outpost must be maintained 
sufficiently to allow reoccupancy. 

During the 42-day (lunar day, night, day) FLO 
first mission, the crew will 

• Perform equipment checkout and maintenance. 

• Unload and transfer equipment and supplies 
between the crew lander and the habitat. 

• Conduct local exploration and sample 
collection. 

• Deploy scientific instrumentation (e.g., for space 
physics and astronomy). 

• Deploy in situ resource utilization (ISRU) 
demonstration equipment. 

• Conduct engineering and operations tests (e.g., 
human and equipment tests under varying and 
extreme thermal and illumination conditions). 


• Perform life science experiments and IVA 
laboratory analyses. 

• Perform crew self-sustenance and operational 
activities (e.g., housekeeping, training, 
planning, eating, resting, public affairs 
communications). 

The habitation element provides all the 
facilities and subsystems (e.g., environmental 
control and life support, temperature and humid- 
ity control, data management) required to sustain 
the crew, except for food, personal items, and 
logistics resupplies that are transferred from the 
crew lander. The habitation element concept is an 
adaptation of the 5SF habitation module with 
deployable solar panels, thermal radiator, and 
high-gain antenna. An airlock is provided for crew 
ingress and egress with provision for lunar dust 
abatement. Regenerative fuel cells provide power 
during the long lunar night. 

3.1 Maintenance Simulation and Analysis 

Maintenance has been investigated as a 
critical issue of the FLO mission. As a critical issue, 
maintenance or lack thereof impacts the 
following: 

• Safety and survivability 

• Mission goals 

• Levels of performance 

• Logistics and spares (and related mass and 
volume) 

• Redundancies (and related mass and volume) 

• Levels of commonality 

• Designs for maintenance and repair 

• Designs for diagnosis 

• Control, monitoring, and fault diagnosis 

• Tools and equipment 

• Sensing and sensors 

• Crew availability 

• Amount and types of robotics 

• Cost 

The requirement addressed in analysis to date 
is to estimate the number of maintenance actions 
to be required as a function of time in the mission 
and the crew time required to accomplish the re- 
quired maintenance. This will allow us to address 
the maintenance impact on the mission story as 
implemented in MSAT and those results. In 
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addition, it will give early insight into the alter- 
native options and impacts to be considered. 

A simulation tool for estimating maintenance 
demand has been developed for the SSF program 
and has been used for the FLO analysis reported 
here. This simulation tool is RMAT developed by 
the JSC Reliability and Maintainability Division and 
Loral Space Information Systems. The following 
discussion of this tool is paraphrased from 
Blumentritti7 and the Assembly and Maintenance 
Implementation Definition Document 1 8. 

RMAT is a stochastic, event-oriented simula- 
tion process written in Fortran and implemented 
on a personal computer. System maintenance is 
simulated at the individual component replaceable 
unit level of detail. Input to RMAT is a data base, 
which for each replaceable unit contains reliability 
data of the mean time between failure (MTBF), 
equipment reliability class (i.e., electronic, elec- 
trical, electromechanical, mechanical, structural, 
and structural-mechanical), and the life limits. 
Maintainability data includes the replaceable unit 
location (internal or external), mean time to 
replace (MTTR), mean time between preventive 
maintenance (MTBPM), and the number of crew- 
members required for the maintenance. Robotic 
requirements can also be defined. Operations data 
in the data base includes the manifest and 
activation stages and the equipment duty cycles. 

Factors that contri bute to the generation of 
maintenance actions are the following: 

• Random failures based on a lognormal 
distribution of the MTBF 

• Early failures that are time-varying multipliers of 
the random failure rates and are based on a 
history of experience of spaceflights and 
satellites 

• Preventive maintenance actions that are 
scheduled actions 

• Life limit failures that are beyond the length of 
time of the current FLO study reported here 

A Monte Carlo simulation approach is used to 
estimate failures. The duty cycle is a part of this 
calculation, as is a cold failure rate to account for 
failure rate contributions when equipment is not 
operating. K-f actors 1 3 are applied as a failure rate 
multiplier to account for maintenance actions that 
occur for reasons other than the inherent compo- 
nent failure rate. For the FLO study, we used the 
default values that were developed by the SSF In- 
Flight Maintenance Working Group 1 8. 


Maintenance time consists of work site time 
plus overhead time. Work site time is the time 
required to remove and replace the line replace- 
able unit (LRU) at the work site. Overhead time 
includes the time to get the replacement part and 
tools, travel to the work site, set up, close out the 
work site, and return parts and tools. A lognormal 
distribution is used to simulate the variability in 
the work site and overhead times. To estimate the 
amount of crew time required, maintenance 
actions are packaged into EVA and IVA crew shifts. 
SSF definitions were used: one IVA shift is com- 
posed of two crewmembers for 8 hr, each one 
performing 4 hr of maintenance; one EVA is 
composed of two crewmembers for 6 hr with 1 hr 
of sortie overhead. 

In order to utilize RMAT to predict mainte- 
nance demands for FLO, a suitable reliability and 
maintainability data base was required. Since FLO 
was at the conceptual design stage, a representa- 
tive data base was sufficient. The similarity of the 
FLO habitat elements and subsystems to the SSF 
habitation module and distributed subsystems 
suggested that SSF component reliability data can 
be used as a reasonable approximation for FLO 
habitat component reliability data. The SSF pro- 
gram developed a reliability and maintainability 
data base of predicted values for its own mainte- 
nance analyses 18. We utilized that data (circa 
1991) to build the FLO data base where elements 
were in "common" between FLO and SSF. 

The mean work site time (MTTR) was 
estimated for each LRU and recorded in the data 
base. We used SSF times from the SSF data base, 
and where items were added we made separate 
estimates by comparison to SSF estimated times. 

Overhead times can be input at the ti me of 
execution of RMAT. We used 0.5 hr for IVA over- 
head times. For EVA overhead times, we chose to 
perform a parametric analysis and used overhead 
timesof 0.5 brand 1.0 hr for each LRU. We view a 
mean overhead of 1 .0 hr as an optimistic goal for 
FLO EVA maintenance actions. 


3.1.1 Simulations and Results 

Our approach to maintenance analysis is to 
perform parametric simulations that will provide 
answers (or insight into the answers) to key 
questions such as the following: 

• What is the level of maintenance actions 
indicated? 
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• What is the crew demand time (work site plus 
overhead) to perform these maintenance 
actions? 

• How many EVA and IVA shifts are required, and 
do they fit within the preliminary allocation? 

• How does the number of maintenance actions 
vary when crew is present and is not present? 

• What level of maintenance action backlogs 
exists? 

• What is the effect of delays in crew arrival from 
2 through 6 months after the habitat has 
landed? 

• What is the maintenance load for follow-on 
outpost visits? 

• What is the impact of backlogged maintenance 
on habitat functionality? 

To answer these and other questions, we 
formulated two basic maintenance scenarios: 

(1) instantaneous replacement, which gives an 
estimate of the maintenance load (a maximum) 
required to maintain the habitat in a full-up 
operational capacity and (2) scheduled resources 
where maintenance is delayed until crew arrival, 
which gives backlog estimates and functionality 
impacts. Both scenarios assume 1 00 percent diag- 
nosis of failures and no cascading failure effects. 
For each of the two scenarios, we simulate 2-, 4-, 


and 6-month delays of crew arrival. We estimate 
EVA, IVA, and total maintenance actions and use 
both 0.5 and 1 .0 hr. for EVA maintenance action 
overheads. For each scenario, we also simulate 
two follow-up missions of 45 days at 6-month 
intervals after each crew departure back to Earth. 
For each simulation, 50 to 1 00 runs (more than 
sufficient) are made by RMAT to calculate the 
results. 

We have also performed the simulations with 
and without the early failure model to establish 
the bounds on results. Although the early failure 
model is considered to overestimate the number of 
maintenance actions, it is considered the better 
estimator for planning purposes. 

Figure 3 shows the bounds on cumulative 
maintenance actions with instantaneous 
replacement for the two cases: (1)all maintenance 
action (MA) types and (2) all MA types excluding 
early failures. We also show these for the two duty 
cycles -crew present and crew not present 
(standby mode). The failure rate for the standby 
duty cycle is 20 percent less than that for the duty 
cycle when crew is present. For most of the 
scenarios, however, the FLO is in standby mode for 
a greater period of time than with crew present; 
therefore, the cumulative error rate will be closer 



711 


tothestandby duty cycle plots. Separate EVA and 
IVA results are estimated but are not shown here. 

Table 2 gives the mean number of mainte- 
nance actions for a variety of scenarios. The 
numbers are not cumulative. For example, the 
number of maintenance actions identified prior to 
the crew landing on the lunar surface is listed in 
the instantaneous replacement scenarios in the 
"visit 1 before" column. The number of new 
maintenance actions that arises while the crew is 
present is listed in the "visit 1 crew" column. The 
number of new maintenance actions that occurs 
after the visit 1 crew has departed the lunar sur- 
face until the time the second crew visits the lunar 
surface is listed in the “visit 2 before" column. For 
the scheduled resources scenarios, the "before" 
columns show the number of mai ntenance actions 
identified up to 2 weeks prior to the crew landing 
on the lunar surface. We speculate that the crew 
brings replacements for this set of maintenance 
actions. From these numbers, the backlogs can be 
calculated. 


Table 3 gives the EVA and IVA requirements for 
maintenance by crew based on maintenance 
actions identified prior to crew departures from 
Earth. This scenario corresponds to a logistics sup- 
port of carrying spares for failures diagnosed up to 
about 1 week prior to crew departure. Results are 
shown for 2-month and 6-month delays and for 
the first three visits to the outpost. Values given 
are for the maintenance actions identified before 
crew departure. The new maintenance actions 
that occur after crew departure from Earth 
through the time of return from the Moon are 
backlogged until the following crew visit. (In the 
2-month scenarios, the number of EVA and IVA 
shifts backlogged to visit 2 exceeds the EVA's and 
IVA's that are in the "visit 1 " column.) In the sce- 
narios that include all failure types, the number of 
required EVA's exceeds the EVA allotment through 
all three visits. IVA shifts required are within the 
allotment for visit 1 but would exceed the same 
allotment for visits 2 and 3. 


Table 2 - Number of maintenance actions. 


Model 

scenario 

Delay before 
first crew visit 

Visit 1 

Visit 2 

Visit 3 

Before 

Crew 

Before 

Crew 

Before 

Crew 

Instantaneous 
replacement, 
all MA types 

2 months 

42 

56 

127 

36 

99 

31 

3 months 

76 

39 

124 

34 

98 

36 

4 months 

96 

40 

118 

43 

89 

31 

6 months 

136 

52 

113 

26 

85 

33 

Scheduled resources, 
all MA types 

2 months 

31 

63 

93 

41 

87 

29 

6 months 

111 

46 

106 

32 

82 

20 

Scheduled resources, 
no early failures 

2 months 

12 

33 

41 

16 

61 

16 

6 months 

49 

18 

62 

15 

61 

17 


Before = Number prior to crew arrival, since last crew departure 
Crew = Additional number occurring during crew visit 


Table 3 - Number of EVA and IVA shifts required to perform maintenance actions 
identified prior to crew departure. 


Model 

Delay before 

Visit 1 

Visit 2 

Visit 3 

scenario 

first crew visit 

EVA 

IVA 

EVA 

IVA 

EVA 

IVA 

Scheduled resources. 

2 months 

5 

3 

12 

14 

10 

12 

all MA types 

6 months 

10 

10 

12 

14 

9 

11 

Scheduled resources. 

2 months 

3 

1 

6 

7 

6 

7 

no early failures 

6 months 

5 

5 

7 

7 

7 

8 


712 




We have made a prel i m i nary assessment of the 
functional impact each scenario has on the FLO 
habitation element. Although RMAThasthe 
capability, our representative data base does not 
include functional block diagrams or compre- 
hensive criticality identifiers. Redundant LRU's 
have shared duty cycles. The functional impact we 
have examined, as output by RMAT, is on the set of 
multiplexers/demultiplexers (MDM's). The MDM's 
are of particular significance because they provide 
the translation between the operative subsystems 
and the data management system for control and 
monitoring. Additionally, they are of sufficient 
number (31) to look at the results from a qualita- 
tive point of view. Scenarios with instantaneous 
replacement have no functional impact. Both the 
2-month and 6-month delay scenarios with sched- 
uled resources and all failure types have similar, 
and apparently significant, functional impacts. 
Approximately 15 percent of the time fewer than 
50 percent of the MDM's are operating; 70 percent 
of the time fewer than 75 percent of the MDM's 
are operating. For the 2-month and 6-month 
scenarios with only random failures and preventive 
maintenance, 25 to 30 percent of the time fewer 
than 75 percent of the MDM's are operating. 

The following are observations from the results of 
the simulations, including those discussed above. 
Unless otherwise specified, the observations are 
based on all MA types and for maintenance actions 
only by crew, with backlogs of maintenance 
actions not diagnosed prior to crew lander readi- 
ness (spares loaded 2 weeks before crew landing 
on the lunar surface). 

• The number of maintenance actions for visit 1 is 
sizable, regardless of scenario, and ranges from 
45 (2-month delay, scheduled resources, no early 
failures) to 188 (6-month delay, all MA types, 
instantaneous replacement). Furthermore, the 
number of maintenance actions for instantane- 
ous replacement and for scheduled replacement 
is in the same ballpark; i.e., 98 versus 94 for first 
visit, 2-month delay and 445 versus 397 for three 
visits, 6-month delay (see Table 2). 

• Except for the first visit of the 2-month delay 
scenario, the greatest demands for maintenance 
actions occur while crew is not present (see 
Table 2). 

• The crews will be faced with a sizable backlog of 
(prediagnosed?) maintenance actions upon 
arrival and will have to contend with significant 
additional maintenance actions that occur after 
their departures from Earth (see Table 2). 


• There are significantly fewer maintenance 
actions for the first visit if the time delay be- 
tween habitat landing and crew landing is 
reduced (e.g., 94 for 2-month delay versus 1 57 
for 6-month delay). But the number of these 
maintenance actions that occurs after the crew 
lander is ready for launch is greater for the re- 
duced delay; e.g., 63 for 2-month delay versus 
46 for 6-month delay (see Table 2). 

• For delays up through 6 months, the peak 
number of maintenance actions occurs on the 
second visit (see Table 2). 

• The number of IVA maintenance actions is 
greater than the EVA maintenance actions by a 
factor of 2 to 3 (interior LRU's outnumber exte- 
rior LRU's by approximately 7 to 1). However, 
the EVA total demand time (work site plus 
overhead plus sortie time) will be similar to the 
IVA total demand time for reasonable levels of 
overhead times (0.5 hr IVA, 1 .0 hr EVA). 

Demand time is not shown here. 

• The allocation of 10 shifts for IVA maintenance 
for the FLO first visit is sufficient to satisfy the 
demand, except for the maintenance actions 
arising after crew departure (6-month delay 
scenario). Additional allocation of IVA shifts will 
be required for visits 2 and 3 (see Table 3). (The 
allocation may be sufficient, depending upon 
further specifics of IVA definition.) All visits of 
scenarios without early failures fall within the 
allotment of 1 0 IVA shifts. 

• A FLO first visit allotment of four EVA's for 
maintenance will not be sufficient; 5 to 10 EVA's 
will be required plus whatever is required to 
contend with the mai ntenance actionsthatwill 
be backlogged. An even larger number of EVA's 
will be required on visits 2 and 3 because of the 
backlogged maintenance actions from previous 
visits (see Table 3). 


3.1.2 Implications of Results 

Implications derived from the simulations can 
provide early insight into the FLO mission design 
and the role of automation and robotics. 
Significant implications for FLO include the 
following: 

• Science, exploration, and technology objectives 
will be impacted unless maintenance demands 
on the crew are minimized. 
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• The indicated number of maintenance actions 
will have a significant impact on logistics 
resupply and resources (spare parts, EVA's, IVA 
shifts, robots, data system, etc.). 

• There will be significant impact to the function- 
ality of the outpost if timely repai rs are not 
made. 

• The number of new maintenance actions after 
crew departure from Earth indicates special 
attention to levels of redundancy and common- 
ality to the lowest level is indicated. 

• Timely, reliable diagnosis of failures will be 
critical and must be designed for. 

• The number of maintenance actions when the 
crew is not present must give rise to serious 
considerations for robotic repair capabilities. 

• Designforeaseof maintenance and repairwill 
be important in minimizing crew (or robotics) 
maintenance demands. 

• An onboard maintenance capability 
(workstation, tools, equipment, etc.) is 
indicated. 

Several factors (sizable number of mainte- 
nance actions, majority when crew is not present, 
work site and overhead times, impacts on explora- 
tion and science) indicate a real need for robotics. 
Robotics are needed to 

• Support diagnosis (test and inspection) for both 
EVA and IVA. 

• Assist the crew by transporting and positioning 
parts, sensors, and tools and possibly positioning 
the crew. 

• Perform robotic maintenance to minimize 
demands on crew, minimize backlogs of mainte- 
nance between crew visits (some maintenance 
will still require crew involvement), and free up 
the crew for science and exploration activities. 

• Perform dusting, servicing, etc. 

In addition to the maintenance actions 
described in this study, there will be other main- 
tenance actions, including dusting of sensitive 
surfaces and repairs to parts not characterized as 
LRU's. These may be infrequent but time con- 
suming. Maintenance of the rover, crew lander, 
and scientific instruments will also be required. 

The results presented here were the first 
simulation results of the FLO mission and have 
demonstrated the merit of early simulations to 
evaluate mission feasibility. As the mission 
definition changes because of these results and 


other considerations, additional simulations 
should be made. The iterations of simulations with 
mission designs early in the mission definition 
stages can be of significant impact in making the 
mission feasible. 

Requirements are characterized early when 
they can have the most benefit at least cost. FLO 
and all similar mission scenarios should adopt a 
design for reliability and maintainability early in 
the program. This design should include, as a 
minimum, consideration of provisions for the 
following: 

• EVA and IVA repair robotics 

• Full fault diagnosis, meaning design for 
diagnosis 

• Critical levels of redundancy 

• Commonality to the lowest level of design 

• Provisions for spares and logistics 

• Designforeaseof maintenance and repair 

• Adequate sensing and testing equipment 

• Tools and equipment for maintenance 

• Maintenance workstations 

• Crew availability and training 

• A knowledge support system 

4. Advanced Life Support System Robotics 

Neither of the above studies explicitly 
addressed the use of robotics to solve the problem 
of excessive crew time being required to operate 
various "subsystems," such as power, communi- 
cations, thermal control, and life support for a 
permanently manned outpost. We briefly address 
life support here. 

Since 1978 NASA has studied closed and 
controlled ecological life support systems (CELSS) 
or advanced life support systems (ALSS), which are 
bioregenerative and based on a combination of 
biological and physicochemical components that 
may be used on future missions in low-Earth orbit, 
in transit to other planetary bodies, and on lunar 
and planetary surfaces. Higher plants will be used 
in food production, water purification, carbon 
dioxide uptake, and oxygen release. 

Agriculture can be very labor intensive or 
assisted by automation (robotics). Operations of 
an ALSS such as crop seeding, nutrient solution 
maintenance, transplanting, plant observation. 
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harvesting, edible biomass separation, transport- 
ing, and preventative maintenance, if carried out 
by intelligent robotic systems, could greatly reduce 
the excessive crew time requirements to a reason- 
able level. Ten crops are apparently needed to 
supply nutrition needs. JSC is working toward a 
year-long, high fidelity test in a Human Rated Test 
Facility (HRTF) with four 90-day stays by a crew of 
four aided by automation and robotics. 

Experience from the Russian BIOS 3 experi- 
ments 1 9 indicated an average of greater than 4 hr 
per day for each crewmember was required to deal 
with food production. Biosphere 2 results indicat- 
ed an average between 2 and 3 hr per day for each 
of eight crewmembers was required to operate the 
food production aspects. Intelligent robotics can 
be used to reduce these times to an acceptable 
minimum for the HRTF 20. 


5. Usefulness of the Technology on Earth 

Space robotics, especially those systems being 
developed to operate on planetary surfaces, can be 
considered a form of the emergi ng technology of 
field robotics on Earth. The solutions to the 
problems we will be solving to make the explora- 
tion of our solar system possible and practical will 
apply to the many problems we have that require 
operating in hazardous environments on Earth and 
critically improving human productivity in many 
fields. Service industries can also use these devel- 
opments in relatively unstructured environments. 

Compared to the applications of space 
robotics in the Shuttle or on Space Station, the 
supervised autonomous robotics needed to make 
space exploration planet surface activities reliable 
and productive are closer to the capabilities 
required on Earth for many productivity improve- 
ments that raise the standard of living for every- 
one. The greatest benefit to the U.S. economy of 
any space exploration related technology can 
come from the development of supervised 
autonomous systems 2i . 


6. Conclusions 

Several conclusions are suggested by the 
results presented in this paper. One is that space 
exploration will be "enabled" by the use of super- 
vised intelligent systems on the planet surfaces, 
including supervised autonomous robotic systems. 
With sufficient use of intelligent systems, the num- 
ber and skills of humans on the planet surface will 


be determined predominantly by surface explora- 
tion, science, and local resource use (not outpost) 
objectives and requirements. Several other uses of 
intelligent systems in Earth orbit or during space 
transport are indicated by current studies. 

Additional modeling studies should be 
carried out to provide further results and insight. 
Our MSAT modeling tool makes these studies 
easier to do relatively quickly. 

Another conclusion is that more definitive 
requirements definition studies should be carried 
out for space exploration supervised intelligent 
(autonomous) robotic systems. 
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Abstract 

The design of a multitasking behavioral 
control system for the Robotic All Terrain 
Lunar Exploration Rover (RATLER) is 
described. The control system design 
attempts to ameliorate some of the 
problems noted by some researchers when 
implementing subsumption or behavioral 
control systems, particularly with regard to 
multiple processor systems and real-time 
operations. The architecture is designed to 
allow both synchronous and asynchronous 
operations between various behavior 
modules by taking advantage of intertask 
communications channels, and by 
implementing each behavior module and 
each interconnection node as a stand-alone 
task. The potential advantages of this 
approach over those previously described in 
the field are discussed. An implementation 
of the architecture is planned for a prototype 
Robotic All Terrain Lunar Exploration Rover 
(RATLER) currently under development, 
and is briefly described. 


Introduction 

One of the more interesting problems in the 
fields of robotics and artificial intelligence 
research is the autonomous navigation and 
control of ground vehicles. The capability 
for a vehicle to autonomously perceive, 
plan, and navigate through obstacle fields in 
realistic conditions has potential application 
in a wide variety of areas, from automated 
warehouse delivery carts to planetary 
exploration platforms. Since the advent of 


modern digital computers the typical 
approach to solving the autonomous 
navigation problem has been through 
Artificial Intelligence (Al) techniques, where 
a systematic procedure of perception, 
modeling, planning, action and feedback 
(see Figure 1) are applied in a manner 
reminiscent of human being's problem 
solving techniquesT2. 


sensors 



actuators 


Figure 1. Traditional Al Approach 

(from Brooks, 1986) 

The computational requirements for such 
systems have proven to be significant, and 
alternatives have been developed over the 
past few years which employ techniques 
that are less anthropomorphic, including 
neural networks, fuzzy logic, and behavioral 
control. Behavioral control was first 
proposed by Brooks^ as a simple and 
extensible architecture that could provide 
fully autonomous systems without the high 
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computational requirements of the 
traditional Al approach. The subsumption 
approach is based on an entirely different 
paradigm which is less 'sequential' In nature 
than traditional Al, and is more 'layered' 
(see Figure 2). 



Figure 2. Subsumption Approach 

(from Brooks, 1986) 


Some observations by researchers who 
have developed these behavioral or 
'subsumption' based systems have pointed 
out pitfalls in the subsumption paradigm that 
can make impiementation problematic'*. 
The use of muitiple processors that are 
physically wired together to produce the 
subsumption system resulted in a system 
that was mechanically complex, required 
custom designed hardware, and was 
somewhat prone to failure. Although these 
problems may be alleviated through the 
miniaturization and consolidation of the 
'macro-sized' multiprocessor system to a 
'micro' or even 'nano' sized custom silicon 
chip®, the basic weakness of these systems 
is their inability to provide internal models or 
representations of the environment. This 
lack of 'state' information makes interaction 
with the system by a user somewhat like 
attempting to converse with a very simple 
lifeform. It must be stated that the 
elimination of internal models or 'state' 
information was one of the objectives of the 
subsumption architecture's proponents, in 
that it is precisely this requirement for 
accurate world models that drives the 
traditional Al techniques' high computational 
requirements, and the argument has been 
made that an insect requires very little 
'computation' power to function 


autonomously®. While it is true that insects 
can function autonomously in a real world 
environment, it is difficult to Imagine a 
technique that allows one to interactively 
direct the behavior of an insect to suit one's 
purpose, and that is after all, one of the 
important criteria for robotic systems in the 
service of humankind. One potential 
approach that not only circumvents those 
problems but may actually facilitate the 
implementation of a subsumption 
architecture is to employ a multitasking 
framework as the basis for system design. 
This approach employs conventional 
computer processors to alleviate the 
mechanical complexity issue, allows 
software modules to operate independently 
in real time much as the subsumption 
architecture's 'behaviors' do, and provides 
many intrinsic features that facilitate the 
communication and interactions between 
independent modules. In addition, a reai- 
time multitasking system allows the use of 
traditional Al problem solving techniques in 
concert with a subsumption architecture, 
resulting in a hybrid system that takes 
advantage of the benefits of both 
approaches. The remainder of this paper is 
devoted to describing how such a system 
could be implemented, and discusses some 
concepts for specific implementation on a 
mobile robotic system under development 
at Sandia National Laboratories' Robotic 
Vehicle Range. It should be noted that 
detailed explanations of the subsumption 
approach are referenced at the end of this 
paper, and are therefore not covered here, 
however Dr. Brooks' notation is used herein 
to describe the use of the subsumption 
architecture's connections and features. 


Generalized Hybrid System Description 


Figure 3 illustrates a generalized hybrid 
system architecture for robotic system 
control, employing both a subsumption 
system (surrounded by a dashed line), and a 
conventional set of real-time inputVoutput 
tasks and associated data stores. Data 
acquisition from the sensor suite is handled 
by one or more specialized tasks, which 
typically run at a fixed rate. The fixed, 
regular rate of data sampling is an important 
characteristic of real-time systems, in that 
the application of some digital filtering or 
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post processing techniques often requires 
that data from several disparate sensors be 
acquired at fixed, repeatable intervals. One 
of the major advantages of a real-time 
multitasking system in this context is the 
ability to add, modify or delete major 
modules w/ithout affecting the timing 
characteristics of the system as a whole, or 
the iteration rate of other task modules in 
the system. As shown in the figure, data 
acquisition is completely independent of the 
subsequent tasks in the system that make 
use of it, as are the control output tasks. 



Figure 3. Multitasking Behavioral 
System Architecture 


By using a set of common data stores 
between the I/O tasks and the subsumption 
control system, the architecture allows for 
the addition of other 'high level' control or 
'Al-like' elements, as shown in Figure 4. For 
example, interfacing a set of tasks which 
perform perception and feature extraction, 
the maintenance of a data-based world 
model, and the planning of actions based on 
the model to the system is straightforward, 
and only affects the system at two points. 
The input data store must accommodate 
any new sensors required for perception, 
and the motion planner's output is hooked 
into one of the subsumption system's 
outputs via a new subsumption node. The 
world model data store is entirely 
independent of any other features of the 
system, and is only interfaced to the new 
tasks which require it. 



The previous example is included only to 
illustrate the method by which a 

subsumption system can be integrated with 
traditional Al approaches, and how a real- 
time multitasking architecture facilitates this. 
In an actual implementation, the addition of 
traditional Al techniques may be obviated if 
the subsumption system is fully 

implemented, resulting in a fully 
autonomous system which is capable of 
initiating motion, avoiding obstacles, 
pursuing some goal directed behavior, 

perception and\or reasoning about disparate 
sensory input, and exploration of its 

surrounding environment. 

As noted in Brooks’ description of the 
subsumption architecture, communications 
between each of the modules and the 
method of interconnecting the various 
modules through special nodes is the key to 
implementation^. The built-in features of 
most multitasking systems that allow 
intertask communication and 

synchronization provide the methods for 
implementing a subsumption system such 
that more than a single specific mechanism 
may be used. The most common feature 
available in multitasking systems is 
message passing. Message passing 
involves packaging data into a special data 
structure specific to the operating system, 
and tagging the data structure's 'destination' 



variable (or pointer) so that the multitasking 
kernel knows which task is the intended 
recipient. The kernel is invoked with a 
special call by the sender, and the message 
arrives at the destination task at some time 
in the future. In a real-time multitasking 
system, the message 'delivery' time is 
guaranteed to be no more than some 
specific interval. For the purposes of a 
subsumption system, the contents of the 
message would be the pertinent data from 
one module's 'output line' as described by 
Brooks, which corresponds to an 'input' line 
to either another module or to a 
subsumption node. In either case, the 
embodiment of behavioral modules and 
nodes as separate tasks makes them 
appear to be identical to the multitasking 
kernel, and are treated in exactly the same 
way by the system. Thus, message passing 
provides an immediate interconnection tool 
to create a subsumption system using tasks 
to implement behavior modules and 
connection nodes, and potentially allows 
both synchronous and asynchronous 
functionality, depending on the details of the 
multitasking kernel's features. 

An additional, complementary method for 
performing intertask communications 
involves the use of a user-defined data 
structure located in global memory (or 
accessed via a pointer) and a real-time 
multitasking feature called an 'event'. 
Events are implemented in a variety of 
ways across different kernels, but usually 
involve a special bitfield or kernel variable 
that is monitored by the kernel for changes. 
Changes in the state of the event bitfield are 
initiated by tasks that wish to record the fact 
that something of importance has occurred, 
and some other task needs to respond. The 
kernel notes a change in the event bitfield at 
some defined, minimum time after a task 
changed the state of the bit, and responds 
by triggering or 'kicking' the second task 
which is hooked to the event bit. Although 
somewhat clumsy when compared to 
message passing, the use of real-time 
'events' and an associated data structure 
operates much faster than the passing of a 
message. This is due to the fact that 
message passing usually involves much 
more in the way of data operations by the 
kernel than the simple setting or clearing of 
a single event. Either way, the intrinsic 


features of a real-time multitasking system 
provide the means by which synchronous 
and asynchronous interactions between 
independent tasks may be applied to 
implementing a subsumption architecture. 


Example Implementation for RATLER 

In the previous section some generalized 
descriptions of multitasking features were 
used to illustrate how a subsumption system 
could be implemented within such a 
framework. In this section, an example 
implementation for a specific mobile robot 
system is described. The example is taken 
from Brooks' description of a subsumption 
system^, and has been modified to fit the 
multitasking approach to implementation for 
the Robotic All Terrain Lunar Exploration 
Rover (RATLER)^. The RATLER II vehicle 
shown in Figure 5 is a four wheeled, skid 
steered articulated chassis design, with the 
body divided into right and left halves, two 
wheels on each side. The halves are joined 
together such that they may rotate along the 
lateral axis to enhance the mobility and 
stability of the platform. The RATLER is 
intended for use as a teleoperated vehicle, 
with semiautonomous navigation 
capabilities selectable by a remote operator. 
The RATLER will eventually carry a suite of 
instruments to perform mission-specific 
tasks, and will also be fitted with a multi- 
DOF manipulator arm. The primary mission 
of the RATLER is planetary exploration, but 
the platform may find terrestrial applications 
as well. 



Figure 5. RATLER II Prototype 


Although not yet implemented, some form 
of proximity sensing will be required on the 
vehicle to provide obstacle detection input 
for the software control system. This will 
most likely take the form of a series of 
infrared or microwave range sensors 
arrayed around the vehicle's periphery, 
which output a signal if an object is detected 
within a preset minimum range limit or set 
of range bins. 

The subsumption architecture as originally 
described by Brooks does not include any 
provisions for a user to interact with the 
system directly. Rather, the focus on 
minimalism to achieve autonomy ignores 
the possibility of teleoperation as a control 
mode. For certain applications, i.e. the 
exploration of a planetary surface too far 
removed from a human teleoperator for 
real-time remote control to be feasible, a 
strictly autonomous operations mode makes 
sense. However, there are many other 
applications of robotic vehicles that would 
benefit from a capability to teleoperate, 
including both terrestrial and possibly lunar 
surface missions^’^. Two implementation 
concepts for a subsumption system with a 
teleoperation capability are briefly described 
below. The first assumes teleoperation to 
be the 'lowest' functional layer in the 
subsumption architecture, whereas the 
second concept assumes teleoperation to 
operate at the 'highest' level. Precisely 
which of the two approaches, or some other 
as yet undefined approach, is best will the 
subject of future work. 


Teleoperation as the 'lowest' level: 

As illustrated in Figure 6, the subsumption 
system proposed for the RATLER is based 
on the real-time multitasking approach 
described previously. A set of dedicated 
tasks perform data acquisition from a sensor 
suite, and run at a fixed, periodic rate 
between 10 and 100 Hz. The timing interval 
is controlled by the multitasking kernel via 
inherent timer interrupts, and is transparent 
to the user. The sensor data is stored in a 
common memory structure and is available 
to the subsumption system's modules as 
required. Each module accesses only the 
data it needs in order to perform its function, 
which is to operate as a finite state 


machine. The data store is accessed via 
pointers, and does not require any event 
bits or message passing. Note that this 
system differs from Brooks' original 
description in one significant regard; that is, 
the lowest functional element is a module 
called 'teleoperate'. This is assumed to be 
the lowest possible level of functionality, 
considering a teleoperation system to be of 
a lower order than a self guided or 
autonomous system. 



Figure 6. Teleoperation as 'lowest' level 


As the lowest level of functionality in the 
subsumption system, teleoperation is 
always active if commands from the user 
(located at a remote control console) are 
arriving and are being stored in the data 
structure. If any of the 'higher* modules 
illustrated in Figure 6 are active, their 
outputs will subsume the teleoperator's 
outputs in a manner identical to that 
described for Brooks' subsumption 
architecture. This simple change from the 
original system description allows a direct 
user interface for teleoperation, and does 
not adversely affect the operation of the 
overall subsumption design. The higher 
subsumption modules may be switched 'on' 
or 'off by the user via a set of flags in the 
data structure, so that the system can be 
operated in either a straight teleoperation 
mode, or in any combination of autonomous 
modes, depending on which behaviors are 
switched on and which are switched off. 
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Each behavior module is an independent 
task which runs at a specific periodic rate, 
typically between 10 and 100 Hz, and 
communicates with other modules via either 
the event bit method or message passing 
method as previously described. The 
subsumption nodes are designated as 
circles with an 'S' in the figure, and are also 
implemented as independent tasks which 
run at fixed periodic rates. In order to 
perform their function as subsumption 
nodes, they must maintain an internal 
record of the latest inputs from their 
associated behaviors, and perform a finite 
state machine transition based on their 
simple set of internal rules. The result of 
the machine state switch determines what 
the node's output signal will be, and that is 
sent via either a message or event bit signal 
to the next behavior or node as illustrated 
above. After the signals have passed 
through the subsumption system and have 
arrived at the output data store, placed 
there via pointer access, a set of dedicated 
output control tasks access the data 
structure for setpoints and translate them 
into output signals to the robot's hardware. 
Functionally, this system is no different than 
more conventional implementations of the 
subsumption architecture, except it relies on 
the inherent facilities of a real-time 
multitasking kernel to perform the 
communications required between modules 
and nodes, and to control the real-time 
preemptive triggering of those modules 
according to a real-time clock. 


Teleoperation as the 'highest' level: 

In contrast to the concept previously 
described for a teleoperation capability in a 
subsumption system, the assumption of 
teleoperation as occupying the 'highest' 
level in the subsumption system leads to a 
very different implementation scheme. The 
system shown in Figure 7 is the basic 
multitasking subsumption architecture 
described in earlier sections of this paper, 
with the addition of a teleoperation module 
outside the subsumption subsystem. The 
teleoperation module receives its inputs 
from a remote control station where a 
human operator is located, whose 
commands are in the form of motorVaxis 
setpoints in position, velocity, or torque. 


Global commands in the form of a desired 
vehicle heading or destination waypoint may 
also be generated from the remote console. 



Figure 7. Teleoperation as 'highest' level 


The desired setpoints or global state 
commands received by the teleoperation 
task are used to generate a set of synthetic 
alternate signal inputs from appropriate 
sensors, which are then placed in the input 
data store. They may simply be used to 
modify real sensor data, or to replace the 
real data altogether much in the same 
fashion as the subsumption subsystem 
operates on its data flows. The act of 
teleoperating then becomes a matter of 
'faking out' the subsumption system by 
providing it input signals that will cause the 
subsumption system to react in the desired 
manner. A simple example would be to 
cause the vehicle to move forward, the 
generation of a synthetic obstacle signal 
from one of the rearward facing obstacle 
sensors by the teleoperate task should 
cause the 'runaway' behavior to activate, 
and the subsumption system would then 
move the vehicle away from the offending 
signal, in other words, forward. This 
approach may have advantages over the 
previously described 'approach, in that it 
relies on the subsumption system to perform 
all control outputs and is implemented 
outside of the subsumption subsystem, 
whereas the 'teleoperation is lowest' method 
requires the teleoperation task to be an 


integrated part of the subsumption 
architecture. 


Summary 

This paper has described an approach to 
implementing the so-called 'subsumption' or 
'behavioral' control scheme for robotic 
systems within the framework of a real-time 
multitasking architecture. The potential 
advantages of this system result from taking 
advantage of timing and communication 
features available with most multitasking 
kernels. The proposed architecture also 
allows for the construction of hybrid systems 
which employ both subsumption and 
traditional Al techniques, and easily 
provides for a teleoperator's interface as 
well. The proposed system is well suited to 
operating on conventional general purpose 
computing hardware, and should allow 
development with existing software tools 
designed for real-time multitasking systems. 


Future Work 

Although only two multitasking features 
were described, event scheduling and 
message passing, there are undoubtedly 
other inherent multitasking system features 
that may be employed to some advantage. 
The benefits of employing teleoperation as 
either the 'highest', 'lowest' or some 
intermediate level with respect to the 
subsumption system remains to be fully 
evaluated. Pursuing those and other issues 
will be part of future efforts by Sandia 
National Laboratories in the development of 
the RATLER control system software. 
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1 Introduction 

During the last few years, there has been a growing 
interest in the use of active conuol of image formation 
to simplify and accelerate scene understanding. Basic 
ideas which were suggested by [Bajesy 88] and 
[Aloimonos et al. 88] has been extended by several 
groups including [Ballard 91], and [Eklundh 92]. This 
trend has grown from several observations. For 
example, Aloimonos and others observed that vision 
cannot be performed in isolation. Vision should serve a 
purpose, and in particular should permit an agent to 
perceive its environment. This leads to a view of a 
vision system which operates continuously and which 
must furnish results within a fixed delay. Rather than 
obtain a maximum of information from any one 
image, the camera is an active sensor giving signals 
which provide only limited information about the 
scene. 

Bajesy observed that many traditional vision problems, 
such as stereo matching, could be solved with low 
complexity algorithms by using controlled sensor 
motion. Examples of such processes were presented by 
Krotkov [Krotkov 90]. Ballard and Brown [Brown 90] 
demonstrated this principle for the case of stereo 
matching by restricting matching to a short range of 
disparities close to zero, and then varying the camera 
vergence angles. The development of robotic camera 
heads has lead to the possibility of exploiting 
conu'ollcd sensor motion and control of processing to 
construct continuously operating real time vision 
systems. 

At the same time, research in applying artificial 
intelligence techniques to machine vision led to an 
emphasis on the use of declarative knowledge to 
control the perceptual process. Systems such as the 
Schema System [Draper et. al. 89] developed a black- 
board architecture in which multiple independent 
knowledge sources attempted to segment and interpret 
an image. A major problem in such systems is control 
of perception. Such systems emphasise explicit 
representation of goals and goal directed processing 
which direct the focus of attention to accomplish 
system tasks. It has not been obvious how such a 
knowledge based approach to control of attention could 
be married to a real time continuously operating 
system. 
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In July 1989, the European Commission funded a 
consortium of six laboratories to investigate control of 
perception in a continuously operating vision system^. 
The consortium partners set out to build a test-bed 
vision system for experiments in control and 
integration. An experimental test-bed system was 
constructed which integrates a 12 axis robotic stereo 
camera head mounted on a mobile robot, dedicated 
computer boards for real-time image acquisition and 
processing, and a distributed system for image 
description. The distributed system includes 
independent modules for 2-D yacking and description, 
3-D reconsyuction, object recognition, and conyol. On 
March 18 1992, a fully integrated continuously 
operating vision system was demonstrated to the 
European Commission using this test-bed. This paper 
reports on the development of this system and the 
research which the system makes possible in conyol of 
a real-time vision system. A more complete description 
of the results of the project may be found in the book 
[Crowley-Christensen 93]. 

1.1 The Project Vision as Process 

The sutrting point for the project “Vision as Process” 
was the demonsyation of an integrated vision system 
capable of continuous real-time operation. It was 
quickly realised that such an ambition raises two 
problems: 

1) The technical problem of integrating processes 
which model the environment in terms of 
descriptions which arc qualitatively different. 

2) The problem of controlling the “attention” and 
processing of a continuously operating system. 

Concerning the first problem, different robotic tasks 
require different kinds of descriptions of a scene. Such 
descriptions can include 2D image description, 3D 
scene descriptions and symbolic labelling of the 
components of a scene. Such processes are 
complementary and mutually supportive. A framework 
is required which would permit the integration of 
multiple vision processes. This can be considered an 
“engineering” problem. 


^Thc Partners in project ESPRIT "Vision as Process" 
are Aalborg University (DK), University of Surrey 
(UK), KTH, The Royal Institute of Technology (S), 
University of Linkoping (S), University of Genoa (I), 
LTIRF-INPG (F) and LIFIA-INPG (F). 


Copyright© 1993 by James L. Crowley and Henrik 1. Christensen. Published by the American Institute of Aeronautics 
and Astronautics, Inc, with permission. 
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The second problem is bolh subtle and fundamcnial. 
Most of the algorithms used in vision have 
computational costs which depend on the quantity of 
data. In the best cases the relation is linear, but in 
many cases it is quadratic, cubic, or even exponential. 
Real time response requires that llie processing time for 
any part of the system is limited. This requires that the 
amount of data considered during each processing cycle 
be bounded which raises the problem of which subset 
of the available data the system should attend during 
each cycle. This is part of the larger problem of 
controlling perception. General purpose real time 
vision system requires a solution to this problem. 

These observations let the consortium to develop a 
long term work plan with both an engineering and a 
scientific goal. These arc: 

En gineering Goal iDevelop techniques for integrating 
cyclic real time processes for description of a 
scene in terms of 2D images, 3D structure and 
labelled objects using active control of a camera 
head. 

Scientific Goal : Develop methods (and a theory) of 
control of attention in perceptual processes. 

With these twin goals, the consortium has developed a 
long term plan leading to the demonstration of methods 
for the construction of integrated continuously 
operating vision systems, and the elaboration of a 
theory for the control of such systems. 

1.2 System Integration and Control 

When the VAP project was conceived in 1988, only a 
small number of vision systems were capable of 
performing symbolic interpretation, and they were 
designed for interpretation of single (static) images. 
The well known examples included VISIONS [Hanson- 
Riseman 78], ACRONYM [Brooks 81], and 3DPO 
[Bolles-Horaud 84]. 

Most work on analysis of image sequences had been 
carried out on pre-recorded images and the level of 
description was almost entirely parametric, i.e,, 
systems could describe regions or features with 
independent motion in terms of their image or 3D- 
velocity. A review of the state of the art is provided by 
Huang [Huang 83]. Continuous and real-time 
observation of a dynamically changing scene involves 
more than motion interpretation. A continuously 
operating vision process must be able to limit 
processing to a small subset of the data available from 
visual sensors, and to adapt its processing mode 
dynamically in response to events in the scene and 
requirements of the task. 

1.3 The VAP Hypotheses 

From its earliest meetings, the VAP consortium agreed 
that vision should be studied in the context of its 
purpose, i.e. its use by other processes. Without 
dedicating to any specific application, this implies that 
visual processing can be controlled to concentrate on 


the subset of visual information which is considered 
relevant to the current goal as defined by a user process. 
In addition, the consortium recognised the ability to 
exploit coherence in the dynamic evolution in a scene. 
In a continuously operating system, temporal context 
permits changes in the scene to be predicted and 
computational resources to be directed to confirm 
expectations. This implies that tracking is basic 
operation within a continuously operating system. 

The goal of the VAP project is to demonstrate that a 
vision system must be designed as a continuously 
operating “process”. To demonstrate this principle, the 
consortium has designed a six-year research program to 
develop techniques to interpret a dynamically changing, 
quasi-su^uctured environment. These techniques exploit 
goal directed focus of attention involving controlled 
sensor motion and control of processing. Processing is 
directed by goals which change dynamically in reaction 
to the needs of the perceptual tasks and to events in the 
scene. 

The following section reviews some previous 
approaches to integration and control. This previous 
work establishes the set of concepts and “prior art” 
from which the design of the VAP skeleton system 
draws. 

2 Systems Architectures for 
Integration and Control 

A suitable system architecture is required for 
experiments in integration and control of a 
continuously operating system. In this chapter, we 
review previous approaches to architectures of vision 
systems. From this review, we argue that flexible 
integration may be achieved through use of a standard 
module architecture, replicated at each of the levels in 
the system. Such a standard module architecture is 
described in more detail in chapter 3. 

2,1 The Reconstruction Approach 

A popular approach for structuring a vision system hks 
been proposed by Marr [Marr 82]. Marr argues for a 
system defined around a hierarchy of representations: 
images, The primal sketch, the 2.5-D sketch (viewer 
centred deptli map), 3-D map and symbolic description. 
In this model, processing is organised as sequential 
processes in which information flows up through the 
levels. Processing is data-driven, in the sense that 
recognition and description are based on descriptions 
constructed at the lower levels in the system. This 
model is computationally demanding and it has proven 
difficult (if not impossible) to provide image 
descriptors that arc sufficiently robust to allow 
characterisation of all phenomena in a natural 
environment. 

The Marr processing model may be termed a 
reconsiruciion approach as it aims at a full 
reconstruction of the environment. The processing 
model is purely data driven, and thus poses a problem 
in terms of computational resources. The Marr model 
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assumes all processing may be carried out as a 
sequential process. This implies that a module uses the 
reprcseniaiion(s) at the level just below as a basis for 
its processing and the result is stored in the next higher 
representational level. The interfaces arc consequently 
well defined. A simplified model of the processing is 
shown in figure 1 , 


add new modules to the system. The common 
blackboard is a potential problem for a continuously 
operating system. All information generated and used 
by the system passes through the blackboard. It might 
thus pose a problem with respect to information band- 
width. To indicate the amount of information, which 
may be generated directly from images one may look at 
the Image Understanding Architecture, described by 



Fig. 1 A processing model for the reconsu-uclion approach. 


In terms of representations, this processing model 
implies that information needed to perform recognition 
and interpretation of settings must be available as part 
of the 3D model, i.e., a diverse set of descriptors must 
be tagged onto the 3-D model representation to 
facilitate recognition and description. This duplication 
of information up through the system and the 
unavailability of pixel level primitives can pose a 
problem in terms of model size and maintenance over 
time. 

2.2 The Non-Committal Approach 

In the VISIONS system [Hanson-Riseman 1978] daui 
are stored on a blackboard, or common storage area and 
processing is performed in parallel by a number of 
“knowledge sources*’. All modules in the system can 
access information at any of the representational levels. 
This implies that information does not have to be 
replicated up through the system to be made available 
for recognition procedures. A simplified model of the 
non-committal approach is illustrated in figure 2. 



Fig. 2. Architecture for the non-committal approach. 

In this architecture, each of the modules in the system 
has control information that specify the information 
that must be available before the module may carry out 
its task. In addition, it has control information that 
specifies the information which may be provided by die 
module. Through use of a control executive, it is 
possible to perform both goal directed and data driven 
processing through use of this control information. 

This processing model imposes few constraints on the 
representations used in the systems and it is simple to 


Weems et al. [Weems 1989]. In that system, the 
storage reserved for intermediate representations is 4 
GB and the system is only aimed at analysis of single 
images. Such an architecture will require extensive use 
of special purpose hardwaie, in particular when applied 
in the temporal context. 

2.3 The Purposive Approach 

Introduction of goal directed operation and use in a 
limited and well defined domain of application allow 
synthesis of a vision system which is composed of a 
set of specifically engineered modules. Such modules 
may be designed to be computationally well behaved, 
in the sense that the computational complexity is 
bounded and often robust representations can be 
provided. This approach to construction of vision 
system has been promoted by Bajesy [Bajesy 88], 
Ballard [Ballard 91] and [Aloimonos et al. 88]. 
Although the approach exploits specific vision 
modules, Bajesy has tried to enumerate a set of 
modules that might form a general purpose system. 
The use of dedicated modules is a way to provide robust 
information and computationally tractable techniques. 
Well known examples of dedicated modules used for 
robot navigation are optical flow modules that can 
compute the position of the focus of expansion for the 
optical flow field, and modules which can compute the 
time to contact from motion in an image sequence. 

This approach to system construction is termed the 
purposive or the animate approach. It is envisaged that 
the construction and analysis of specific modules will 
gradually provide insight that will allow definition of 
modules applicable in general vision systems. The 
convergence towards a standard set of modules through 
analysis of diverse application domains might provide 
valuable insight, but it is not obvious that convergence 
will be achievable. 

In the purposive approach, the exploitation of 
information is task driven and may be very different 
from one task to the next. The basic system 
architecture should thus be flexible and facilitate 
dynamic change of the information flow. In practical 
systems a number of modules may exploit the same 
representation and once a system has been defined an 
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analysis of the representational requirements may point 
to the definition of a set of standard representations. 
Given present state of the art no such general 
representations are known. 

A purely purposive approach to vision rejects most of 
the established techniques. Modules are to be built 
from scratch whenever a new type of 
information/representation is required. There is 
consequently a concern that from this approach little 
insight will be gained in terms the general vision 
problem. 

2.4 The Vision as Process Architecture 

During the first year, the consortium “Vision as 
Process” addressed the problem of design of an 
architecture which would meet the following criteria: 

1) Continuously operating. 

2) Integrating software contributions from 
geographically dispersed laboratories 

3) Integrating description of the environment with 
2D measurements, 3D models and recognition of 
objects. 

4) Capable of supporting diverse experiments in 
gaze control, visual servoing, navigation and 
object surveillance. 

5) Dynamically reconfigurable as the task changes. 

The result was the design of a distributed test-bed 
system composed of independent modules. Modules 
may communicate by message passing over a central 
message server, or by dedicated “high-band width” 
channels. Systems can be composed from sub-sets of 
the available modules for individual experiments. 


a more flexible processing model was needed to make 
computations both efficient and robust. 
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Fig. 3. The VAP system architecture 

Having selected a distributed architecture composed of 
modules, the consortium turned its attention to the 
design of a common component for each module. At 
the very least, this standard module must provide the 
communications interface. It was soon observed that 
scheduling was a basic to continuous operation and a 
cyclic scheduler was provided which calls the 
procedures which implement each phase of 
computation. The phases of operation included phases 
for integration of new data and phases for conuol of 
processing. 


The architecture adopted by die consortium is shown in 
figure 3. The system has a data flow part, which is 
similar to the Marr processing model. It should be 
noted that the data flow is not only bottom-up but may 
also be top-down. Top-down expectations (derived from 
the present set of goals and contextual information) can 
be used to direci/control processing at lower levels, 
while detected event at the same time can drive a 
reconstructive mode of processing. The VAP 
architecture contains a common communication 
channel that allow communication between any pair of 
modules in the system. This communication channel 
may be used both for investigation of a non-commituil 
processing model, and for investigation of purposive 
systems, as the component processes in the system in 
figure 3 may either general purpose or dedicated. 

This architecture imposes few constraints on the design 
of component modules and it provides flexibility for 
the investigation of system level control issues. 
Initially it was envisaged that the main flow of data 
would exploit the communication links between 
adjacent modules, while only control information 
would be communicated through the common channel. 
During the execution of the project it was realised that 


In order to obtain temporal context, the consortium 
drew on previous results in image tracking. A tracking 
architecture was defined composed of the phases prcdict- 
match- update [Crowley et al. 88], [Gran urn -Christensen 
88] based on techniques used in the control community 
since the early sixties [Kalman 60]. The architecture is 
shown in figure 4. 

The analysis block, in figure 4, is responsible for the 
frame by frame analysis, which generated a set of 
geometric primitives (or tokens). The correspondence 
with information in the temporal context is performed 
in the match block. To simplify matching the 
information in the temporal context is used in a 
prediction of the expected content of the next frame. 
Once correspondence has been established the 
information confined in the internal models must be 
updated to reflect the new information contained in the 
new frame. Once updating has been carried out the 
cycle may start all over again. 
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Figure 4. Basic Prcdict-Maich-Updalc cycle for tlic module architcclure. 


As a model at level N+1 is used for prediction of 
primitives in the next frame, the predictor may also be 
given other types of input which can be used for 
guidance of processing. Introduction of goal derived 
information into the model at level N+1 will 
consequently allow top-down/attention based control of 
processing. A prediction may be transformed into a 
representation that is compatible with the one used the 
level below, so that it may drive processing at the level 
below. This flexibility facilitates investigation of 
different control strategics. 

2.5 Control Issues 

Construction of an operational system includes issues 
in control to ensure satisfaction of user defined goals. 
Goals are widely recognised as a fundamental 
component of intelligent systems. The consortium 
initially defined a set of general goal commands : 

search(X): Is X present in the scene? 

find(X): Where is X, given X has been 

identified earlier? 

rcIatc(X,Y): What is the spatial relation between X 
and Y? 

describc(X,Y): Detcnnine property Y for object X 

watch(X): Allocate resources for notification of 

changes for X 

track(X): Maintain a dynamic description of 

object X. 

This set of goals defines the user level interface to the 
system. Based on the potential goals, the system must 
be able to allocate its resources for optimal satisfaction 
of the concurrent goal(s). 

A number of approach to goal directed processing have 
been reported in the vision literature. Most of these 
efforts include use of a cyclic process, in which data 
received are matched against expectations. Depending 
on success or failure in the matching process an 
updating or event detection process is used to drive the 
next cycle of processing. An example of such a cyclic 
process is described by [Tsotsos 87] for the ALVEN 
system. 

When the VAP effort was initiated the use of 
production systems and reasoning under uncertainty 


appeared the most promising in terms of providing 
insight into the problem of the cycle of planning, 
sensing and interpretation. These tools have been 
incorporated into the system. In system level control, 
externally defined goal commands arc translated into 
actions by rule based planning. Planning generates the 
sequence of state transitions (actions and their 
parameters) expected to allow completion of a goal. 
These actions arc then executed by one or more system 
modules. The internal handling of such actions is an 
issue that is resolved for each of the modules. A 
skeleton system constructed by the consortium 
provides the framework for experiments in control and 
coordination of visual processes. 

3. The SAVA III Skeleton System 

In order to perform experiments in control and 
integration of a continuously operating vision system, 
the VAP consortium constructed an empty “skeleton” 
system. This skeleton was then provided to partners so 
that they could “fill in” the functional parts needed for 
their experiments^. This system was named SAVA, 
for the french acronym "Squclette d’ Application pour la 
Vision Active'*. The SAVA Skeleton system provides 
a standard module with communication and interface 
components that permit an experimenter to construct 
and run distributed real time vision system. 

The structure of SAVA has evolved with our 
understanding of the problems of integration and 
control. The original SAVA system was released at 
month 12 of VAP-I. Experiences during the second 
year of VAP-I brought out a number of shortcomings 
in the design. A team composed of people from AUC, 
KTH and LIBIA designed a revised version, SAVA II, 
which was released at the month 24 milestone. An 
intense integration effort was performed in preparation 
of the month 33 integrated demonstration, with 
software and hardware contributions from all VAP 
partners integrated into SAVA II. Modifications in 
communications and interface design, as well as a large 
number of small improvements, led to release of 
SAVA 2.4 in March 1992. 

Experience with SAVA II has shown the importance of 
demons for combining purposive and event driven 
control of perception. This led to the desire for an 
interpreter for demon functions. In addition. 


^Thc incompatibility of successive releases of MOTIF 
have created problems for portability and have cost the 
consortium considerable time and money. 
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programming control experiments in SAVA II was a 
somewhat difficult task. Control knowledge was 
embedded in procedural code and thus hard to understand 
or change. It was decided to design a control system 
based on an interpreter for declarative expressions of the 
control logic. From these two needs emerged the idea 
of using the CLIPS 5.1 rule interpreter for the control 
component and the demon interpreter within each 
module. CLIPS 5,1 is written in C and is provided 
with the full source code. As a result, it was extremely 
easy to integrate the SAVA modules into the CLIPS 
environment. 

A new version of the skeleton system, SAVA III, has 
been created based on the principle of interpreting 
control information. In SAVA III, most of the 
procedures for processing and communication arc 
written as C procedures and explicitly declared to the 
CLIPS 5.1 rule interpreter. Rules and functions arc 
then written using these procedures. The basic 
processing cycle is built as a sequence of states with 
transitions managed by rules. The processing performed 
within a state can be easily changed based on cither 
perceptual events or external commands. Because the 
control rules are interpreted, the control sequence for a 
module may be changed dynamically, without re- 
compiling a module. It is even possible for a module 
to send another module function definitions as ASCII 
messages, using the CLIPS deffunction facility. The 
rule based scheduler is particularly useful for the 
implementation of demons. Demons may be 
programmed as rules which react to the contents of the 
model as well as to external messages. 

In addition to the changes in ilie control part, a major 
effort has been made to add the possibility of 
synchronised operation to the modules. SAVA is a soft 
real-time system, distributed over a set of workstations 
operating under UNIX. At some time in the future, wc 
intend to port SAVA onto dedicated hardware running 
under a real time programming environment. However 
such systems are relatively difficult to program and 
debug. The use of UNIX and distributed processing 
permits the VAP-II project to perform experiments 
with a reasonable effort. 

A synchronisation system has been built into SAVA 
III in order to compensate for uncertainties in 
communication and execution time for distributed 
modules. A synchronisation module provides other 
modules with a universal time reference. In this way, 
all information that is processed or communicated is 
time-stamped, permitting an estimate of dynamic 
processes to be observed orconu-olled. 

The following sections present a detailed description of 
the components of the SAVA III system. It first gives 
a brief overview of the components of the skeleton 
system and its standard module. It then describes 
processes for interpreting messages using a rule based 
interpreter, and the design of “demon” processes that 
perform pre-attentive detection of events. A description 
of the rule based control of a module is presented, 
followed by a description of the synchronisation of 
modules. 


3.1 Overview of the SAVA III 
Software Skeleton 

The SAVA skeleton system is composed of the 
following components: 

1) A launcher program that permits the user to 
assign modules to processors and to initiate 
operation. 

2) A distributed mailbox system that is launched on 
the different processors to establish a 
communications system and to launch the 
component processes. 

3) A library of gjjmjUijnkiLliQn. procedures for 
modules. This library include procedures for 
communication by message as well as procedures 
for dedicated high band-width communication 
between processes. 

4) A skeleton module structure built around a 
scheduler. 

5) A set of graphical man-machine interfaces. 

The SAVA system provides mailbox communication 
for data, control and acknowledgements, as well as a 
procedures for dedicated high-band-width channels 
between modules. Messages include formatting 
information that permits the message passing system 
to pack and unpack messages. 

Visual perception is performed within processes 
imbedded in copies of the SAVA “standard module”. 
The SAVA III standard module is shown in Figure 5. 
The standard module is composed of a number of 
procedures (shown as rectangles) that are called in 
sequence by a scheduling process. 

A SAVA module repeatedly executes a cycle in which 
it: 

1) Acquires new data. 

2) Transforms this data into an internal 
representation. 

3) Mtikes predictions from its internal model. 

4) Matches the predictions with the transformed data, 

5) Uses the match results to update the internal 
model, 

6) Executes demons to detect perceptual events 
witliin tlie internal model. 

This cyclic process is executed by a rule-interpreter. 
Each phase corresponds to a state in which a particular 
operation is performed. At each state transition new 
messages which have arrived on the mail box channel 
arc read and processed. Such messages may change the 
procedures that arc used in the process, change the 
parameters that are used by the procedures, or 
interrogate the current contents of the description that 
is being maintained. 
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The cyclic process within a module is managed by a 
control token placed on the working memory. This 
control token is a simple list in which the first atom is 
the word “phase” and the second is the name of one of 
the phases: (get-data, transform, predict, match, update, 
messages, demons). The definition of theses phases is 
as follows: 

get-data Acquire a new observation 

transform Transform the data to the internal 
representation 

predict Predict the contents of the observation 

match Match the prediction to the observation 

update Update the model using the correspondence 
of the prediction and the observation 

demons Execute a set of automatic procedures for 
event detection. 

At the end of each cycle, die scheduler executes a set of 
demons. Demons arc responsible for event detection, 
and play a critical role in the control of reasoning. 
Some of the demon procedures, such as motion 
detection, operate by default, and may be explicitly 
disabled. Most of the demons, however, arc specifically 
designed for detecting certain types of structures. These 
demons are armed or disarmed by recognition 
procedures in the interpretation module according to the 
current interpretation context. 


In the SAVA III system, the procedures of a module 
arc made explicitly available to the CLIPS 5,1 rule 
based interpreter. This includes the original SAVA II 
scheduler, so that the system is upwards compatible. In 
addition to acting as a scheduler, the rule interpreter is 
also used to define the control part of demons and to 
interpret messages from other modules. 

3.2 Communications Between Modules 

Modules communicate control, data requests, reply and 
synchronisation information using message passing 
based on Unix Sockets. The SavaSend() function is 
used to send mailbox messages to other modules. A 
SavaSend command contains three fields: 

Header: The destination and type of message. 

Fonnat: An ASCII description of the message format. 

Body: The message including commands and 

parameters. 

The destination is a symbolic name for another 
module. The types of message may be control, 
acknowledge or data. All message exchanges are 
initiated by a control message. The format string is 
transmitted with the message and is used both to 
encode and decode the message. In this way a change in 
message protocol may be made with a minimum of 
difficulty. This format string can contain conversion 
directives like %d, %f, and %c, based on the C 
language printf protocol. We have added conversion 
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direclives for sending arrays, structures and images. 

Many SAVA functions accept a variable number of 
arguments. Furthermore, the type of these arguments is 
unspecified. These functions accept a fixed number of 
normal arguments, followed by an arbitrary number of 
arguments of unknown type. The last normal argument 
is a format string which describes the arguments 
following it. 

Large data structures may be communicated between 
modules using dedicated sockets. Communication of 
dedicated channels are performed by the functions 
SavaRead and SavaWrite. As with mail-box, 
messages, high band-width channel messages are 
encoded with an ASCII format directive which is 
transmitted with the message. High band-width 
channels in SAVA are faster than message passing 
because the channels provide a direct connection. No 
intermediate routing is necessary. 

Messages passed through the mail box communication 
system are interpreted by the rule interpreter. New 
messages are transformed into working memory 
elements by the function “checkmessage**. 
Checkmessage creates a list in working memory 
composed of the keyword “message” followed by the 
name of the sender, a message keyword and and ASCII 
string with the body of the message. Checkmessage 
assures upwards compatibility with message types that 
were defined in SAVA II which have not be 
transformed to SAVA III. 

The checkmessage function is executed at the end of 
each phase of the standard module. For example, the 
transition from match to update is performed by the 
rule “update-phase” : 

(defrule update-phase 

(declare (salience -100) ) 

?p <- (phase match ?c) 

=> 

(check-message) 

(retract ?p) 

(assert (phase update ?c) ) 

) 

The result of check-message is a list of the form: 

(message ?sender ?command ?body) 

If ?command string is tested to determine how the 
message should be interpreted. If command corresponds 
to one of the CLIPS keywords “deffunction” or 
“defrule”, then ?body is interpreted by the CLIPS 
function BUILD. This is permits external modules to 
define functions and rules using the CLIPS deffunction 
and defrule constructs. 

If ?command corresponds to a previously defined 
function, then ?body is executed using the clips “eval” 
construct. If ?command is unknown, or the 
interpretation is not successful, eval returns FALSE. 
The result returned by eval is used by the function 
“reply” to send a reply to the sender. If the message 
evaluates to a “NIL”, then reply docs not send a 


message. 

The CLIPS function “build” will interpret a string as if 
it has been typed to the interpreter. This may be used 
to interpret defrule and deffunction messages from other 
modules, as shown by the rule “interpret-def- 
comman(Js”. The command “mv-append” is used to 
compose a list with the desired commands. 

(defrule interpret-def-commands 
(declare (salience 100)) 

?m <- (message ?sender ?command ?body) 
(test (member ?command 

(mv-append deffunction defrule))) 

=> 

(reply ?sender (build ?body) ) 

(retract ?m) 

) 

Functions may be defined at initialisation or by 
messages from other modules. A function, encoded in 
an ASCII string, may be executed using the CLIPS 
“eval” command, as shown by the rule “interpret- 
function-messages" 

(defrule interpret- function-message 
(declare (salience 100) ) 

?m <- (message ?sender ?command ?body) 
(test (member ?command 

(mv-append list-deffunctions) ) 

=> 

(reply ?sender (eval ?body) ) 

(remove ?m) 

) 

If the interpretation is not successful, eval returns 
FALSE. Unless the result is NIL, it is sent to the 
?scndcr in a reply message. 

3.3 Automatic Interpretation by Demon 
Processes 

A demon is an automatic procedure which operates on 
the internal model of each module to detect events. 
Currently active demon procedures are executed after the 
update phase of each cycle. Demons are responsible for 
event detection, and play a critical role in the control of 
reasoning. Some of the demon procedures, such as 
motion detection, operate by default, and may be 
explicitly disabled. Most of the demons, however, are 
specifically designed for detecting certain types of 
structures. 

Demons may be invoked by other demons or by 
commands received from other modules, including from 
a human supervisor. A demon is instantiated by 
entering a demon token in working memory. A demon 
token is simply a list with three elements: 

(demon <name> <id>) 

where <name> is the name of the demon and <id> is a 
unique identity determined by the function “gensym”. 
Multiple copies of the same demon can be instantiated, 
each having its own “id”. Each demon can create its 
own state in working memory, indexed by <id>. A 
demon can be removed by removing the demon token 
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from working memory. 

The control part of a demon is encoded as rules. As an 
example consider a demon to find ellipses in the image: 

(defrule ellipse-f inder "The demon for an 
ellipse finder" 

(phase demons) 

(demon ellipse-f inder ?id) 
(ellipse-demon-data (id ?id) 
(parameters ?p) ) 

=> 

(assert (get-ellipses ?p) ) ) 

If we suppose that the function “get-ellipses” will 
instantiate a structure of type ellipse for each ellipse 
found, then a second rule can be written to treat each 
ellipse. 

(deffrule hypothesize-cylinder "generate 
cylinder hypotheses" 

(phase demons) 

(demon cylinder-finder) 

(ellipse (id ?id) (cx Vx) (cy ?y) 

(major ?ma) (minor ?mi) 

(angle ?angle) ) 

(test (< 5 (abs ?angle) ) ) 

=> 

(assert (cylinder (cx ?x) (bottom ?y) 
(radius ?ma) (ellipse ?id) ) ) 
(assert (cylinder (cx ?x) (top ?y) 
(radius ?ma) (ellipse ?id)))) 

Other rules can be used to detect the existence of 
cylinders with the same axis and to reduce cylinder 
hypotheses to a minimum number, or to use the 
hypothesis of a cylinder with several ellipses to 
generate the hypothesis of a cup. 

Goals for the module can be entered into working 
memory as a three element list: 

(goal <name> <priority>) 

Goals can then have the effect of activating and dis- 
aciivating demons. An example of a goal invoking a 
demon is the rule “cup-demons”. 

(deffrule cup-demons 

"invoice the cylinder finder" 

(phase demons) 

(goal find-cup ?p) 

-> 

(assert (demon cylinder-finder))) 

An example of removal of demons is the rule “removc- 
non-cup-demons”. 

(deffrule remove-non-cup-demons 
"remove cup demons" 

(phase demons) 

(goal find-cup ?p) 

?d <- (demon ?n ?id) 

(not (?n cylinder-finder)) 

=> 

(retract ?d) ) 

Having a rule interpreter provides explicit control 
knowledge for demons and their control logic. It also 


permits the working memory to be used to create and 
free working memory for representing demons state. 
The result is a flexible, easy to use, tool for 
experiments in control of perception. In the following 
section we present an example of such control. 

4 The Visual Navigation 
Demonstrator 

This section illustrates the use of SAVA III by 
presenting an overview of the a visual navigation 
system. This system was constructed for the milestone 
1 demonstration of VAP-II presented in June 1993. 
The structure of the demonstration system is shown in 
figure 6. The system is composed of processes for 

1) Fixation control of the binocular head. 

2) Local navigation actions for a mobile robot. 

3) Image acquisition and prcx:essing. 

4) Tracking and grouping a 2-D description of the 
contents of the image. 

5) Computing and maintaining a 3-D description 
around a fixation point. 

6) Recognition of landmarks and object. 

7) System Supervisor for coordinating processing of 
the other system modules. 

Rxation Control Unit 

The fixation control unit provides a standard interface 
to the device controller for the VAP/SAVA binocular 
stereo head. This module maintains a copy of the 
current state of the fixation point and the component 
axes for the binocular head. It receives commands in 
the form of tasks expressed in either device or motor 
coordinates. Commands arc communicated to the 
binocular head, the robot-arm (neck) or the mobile 
platform using such device-level conyol. 

The fixation control unit also contains facilities for 
programming procedural style “perceptual actions”. 
Such perceptual actions are reflex procedures that 
command the slate of the binocular head at either the 
device level or the motor level based on measurements 
made from images. Examples of low level perceptual 
actions include ocular reflexes for servoing aperture, 
focus and vcrgence. Other examples include procedures 
for tracking a moving object. 

Image Acquisition and Processing 

The image acquisition and processing module handles 
all image processing retiuirements for the other 
modules, thus minimizing the communication 
requirements. This module is based on two computer 
cards constructed by the consortium. The first of these, 
tlie Pyramid card digitizes synchronised stereo images 
and immediately computes a 12 level binomial pyramid 
for the two images. Processing time for each pair of 
images is 40 ms. The second card extracts edge 
segments using Gaussian derivatives. 
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The edge extraction process begins by calculating the 
horizontal and vertical derivatives within the region of 
interest. These derivatives are then combined by tabic 
look-up to compute the gradient magnitude and 
orientation. Points which are extrema in magnitude arc 
marked as potential edge points and compared against 
two thresholds. Hysteresis thresholding is applied so 
that only regions of edge points containing at least one 
point above the threshold are considered. Adjacent edge 
points with a similar orientation arc grouped to form 
line segments. Edge segments arc represented by a 
vector of parameters that includes the mid-point, 
orientation and half-length. 

Three classes of image processing procedures arc 
available in the image processing module 

1) Edge S^egment Extraction . On command, the 
module will transfer the pixels within a region of 
interest to an edge extraction card produced by the 
consortium. This card computes the gradient 
magnitude and orientation and detects pixels 
which are extrema in gradient magnitude. Detected 
pixels arc grouped in single raster scan to 
construct edge segments. Gradient magnitudes tire 
compared to two thresholds to provide a 
hysteresis based thresholding. 


2) Edge Chain Extraction : In place of edge 
segments, another module may request edge 
chains. Edge points are are computed by the same 
algorithm as for edge segments. A one pass 
raster-chaining algorithm is used to construct a 
list of edge chains within the region of interest. 
The edge chaining code is computed on a co- 
processor Ctird based on the Intel 680 

3) Measures [qt Ocular Reflexes . In order to avoid 
communicating images, the measurements on 
which ocular-motor reflexes are based have been 
placed in this module. Measures include coarse to 
fine computation of phase for convergence, and 
gradient based measurements for aperture and 
focus. 


An image description is maintained by a tracking 
process which uses a first order Kalman filter to track 
edge segments. This tracking process improves the 
stability of image primitives, permits the system to 
maintain correspondence of image features over time, 
and provides an estimate of the position and velocity of 
image primitives as well as the uncertainty of these 
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estimates. It also permits information about the 
movement of the head or vehicle to be used to 
compensate for movements by the robot. A vocabulary 
of model access and grouping procedures give 
associative access to the 2D description modules. These 
procedures are used by a library of "demon" procedures 
which can be enabled in order to provide data driven 
interpretation of the image description. 

Separate image description modules exist for the right 
and left cameras. The 2D image descriptions arc 
maintained by a tracking process that uses a first order 
Kalman filter to track image description primitives. 
This tracking process improves the stability of image 
primitives, permits the system to maintain 
correspondence of image features over time, and 
provides an estimate of the position and velocity of 
image primitives as well as the uncertainty of these 
estimates. 

Model access primitives use matching and grouping to 
interrogate the contents of the token model. A set of 
demons may be invoked by other modules to 
interrogate the description after each update using the 
model access primitives. Access to the 2D model is 
provided by a large vocabulary of model access and 
grouping procedures. It is also possible to compose 
sequences of these grouping procedures, extracting, for 
example, all the junctions near an ideal line. These 
procedures may be called by other modules witliin the 
system, or they may be invoked by a set of 
interpretation demons. These demons are placed on an 
agenda by messages from other modules. After each 
update cycle the demon agenda is executed. 

3D Geometric Scene Description Module 

In addition to a description of images, the skeleton 
system maintains a geometric description of the scene. 
This geometric description expresses the structure 
within a region of interest of the scene in terms of 3D 
parametric primitives. This module assumes that the 
phase based convergence reflex maintains the cameras 
converged on an object. Convergence maintains edge 
segments from a region of interest in the scene in the 
similar positions in the image. The image description 
access primitive "FindPrototypeSegment" is used to 
construct a list of possible matching segments in the 
left and right image. This list is sorted based on 
similarity of length, orientation and position. The 
most likely matches arc selected for 3D reconstruction. 

Reconstruction requires camera calibration. A novel 
procedure for dynamic auto-calibration of cameras has 
been developed. This procedure permits a reference 
frame for a pair of stereo cameras to be constructed for 
any scene objects. The projective transformation 
matrices from object centered coordinates can be 
obtained by direct observation (no matrix inversion) 
and can be maintained by a very simply operation. 
These matrices make it possible to reconstruct the 3D 
form of objects in an object centered reference frame. 
As with the image tracking and description module, 
the geometric description is maintained by a tracking 
process in order to provide stability and to maintain 
correspondence over time. 


Symbolic Scene Interpretation 

The symbolic scene interpretation maintains a 
symbolic description of the scene in terms of known 
object categories (or classes) and qualitative relation. 
This description is built up and maintained by 
interrogating the contents of the image and scene 
description modules. The SAVA III symbolic 
description process was implemented using the CLIPS 
rule interpreter system. Rules implement a 
hypothesize and test process which is triggered by 
demons. Working memory of the production system 
serves as a blackboard into which recognition 
procedures can posie their results. 

Process Supervisor 

The process supervisor maintained a list of places and 
routes which the system is to travel, as well as a data 
base of “landmarks** which the system is to find during 
mission execution. The supervisor plans a navigation 
which it then executes by sending commands to the 
other modules. An interesting aspect of the supervisors 
operation was coordinating between the competing 
tasks of watching in front of the robot for obstacles and 
searching for landmarks for position correction. 
Obstacles must be searched at least once every 50 cm, 
while landmark detection is required whenever the 
uncertainty of the estimated position passes a certain 
threshold. Both operations require command of the 
camera head. This balancing act was performed by a 
finite slate automata programmed as a set of rules. 

Navigation Control 

The navigation module controls vehicle actions by 
sending commands to an on-board vehicle control 
program. The on-board program, known as the 
“standard vehicle controller”, provides asynchronous 
independent control of forward translation and rotation. 
The on-board controller acts like auto-pilot, stabilizing 
the vehicle and estimating its position. The controller 
accepts both velocity and displacement commands, and 
can support a diverse variety of navigation techniques. 
The controller is capable of responding to commands at 
any time using a simple serial line protocol. New 
commands for displacement immediately replace 
previous commands. This permits visual servoing to 
be u.sed to pilot tlic vehicle. 

Position and orientation are modelled in the vehicle 
controlled using Kalman filter to maintain an estimated 
position and covariance. The control protocol includes 
a command to correct the estimated position and 
orientation and their uncertainty from external 
perception using Kalman filter update. This command 
has been used to update the estimated position by 
observing the angle to known objects. The LIFIA 
standard vehicle controller is described in greater detail 
in [Crowley-Reignier 93]. The navigation module 
contains procedures to detect and avoid obstacles, and to 
locate and use landmarks for updating the vehicle’s 
estimated position. 
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5. Conclusions 


According to the pfinciple- of “purposivcncss” a vision 
system operates in order to furnish an observation 
function for some task. In order to enrich our task 
domain, we have adapted the VAP Skeleton system to 
serve as the visual component for a mobile robot 
navigating in an indoor environment. We stress that 
the visual navigation is not, in itself, the goal of the 
project. Visual navigation is a task which is 
sufficiently rich in events to explore the problems of 
integration and control of an active perception system. 

During the last four years, the VAP consortium has 
constructed a number of demonstrations of 
continuously operating vision systems. In each of 
these systems explicit control of sensor motion and 
processing has permitted the system to operate in real 
time, with increasingly degrees of robustness. The 
consortium experience has verified the VAP hypothesis 
that control of continuously operating process is basic 
to the design of a genera! purpose real time vision 
system. 
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Abstract 

This paper proposes an architecture for real time vi- 
sion processing on parallel processors with physically 
distributed shared memory, and presents an initial 
implementation of the architecture on i860-based 
Mercury Computing Systems. Within the framework 
of the architecture, each vision function (such as me- 
dian filtering or contour extraction) is defined as a 
task or a set of tasks. A collection of these tasks, 
along with associated data, may be recursively divid- 
ed into subtasks and processed by multiple proces- 
sors through the coordination of a task queue server. 

The task queue sen/er resides in shared memory 
accessible by all the processors. Each idle processor 
subsequently fetches a task and associated data 
from the task queue server for processing and posts 
the result to the shared memory for later use. In this 
way load balancing within the parallel processing sys- 
tem can be achieved without a centralized controller, 
as demonstrated by experimental results. 

1 . Introduction 

It is well known that vision processing involves a tre- 
mendous amount of computation. It is even more so 
for real time vision processing such as vision guided 
grasping of free-floating objects in space [1], in which 
processing cannot be carried out in real time without 
high processing power provided by parallel comput- 
ers such as Hypercubes [2] or i860-based Mercury 
Computing Systems [3]. Even with the availability of 
powerful parallel computers, the real challenge is to 
find the best strategies for mapping data and vision 
tasks onto underlying parallel architectures. 
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A great deal of effort has been directed toward ex- 
ploring parallelism in pixel-level image processing by 
taking advantage of its simplicity and data regularity 
[4]. The success of parallel image processing, how- 
ever, has not been extended to mid-level feature 
processing or high-level image understanding. This 
is due to: 

1 . simple data partitioning methods do not fit to the 
mid-level and high-level vision processing, and 

2. existing Ethernet-based network discourages dy- 
namic data/task migration. 

The advance of VLSI technology in the past decade 
makes it possible to build tightly-coupled parallel 
computers with powerful general-purpose micropro- 
,cessors (such as i860s) interconnected by high 
bandwidth crossbar switches. Furthermore, research 
in physically distributed shared memory [5] has 
reached a stage where the support of physically dis- 
tributed shared memory model on commercial paral- 
lel processing systems becomes a standard, rather 
an exception. The availability of a powerful parallel 
processing system with the support of the physically 
distributed shared memory model has encouraged us 
to study more powerful methods for data and task 
partitioning, task allocation, task scheduling, and load 
balancing, in mid-level feature processing and high- 
level object recognition and pose estimation. The re- 
sult of this effort is the design of our proposed archi- 
tecture for real time vision processing. 

The proposed vision architecture has evolved from a 
vision architecture, known as PARADIGM [6], de- 
signed and implemented on NECTAR (a fiber-optics 
based high-speed network backplane for heteroge- 
neous multi-computers) [7]. PARADIGM was de- 
signed to provide mechanisms and primitives that not 
only allow vision-related parallel programs to be de- 
veloped with ease, but also maximize program con- 
currency at both task-level and subtask-level. While 
developing their programs, users need only focus ef- 
forts on problem solving and task partitioning, without 
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a need to worry about the details of communication 
and task scheduling. PARADIGM is a distributed 
system with centralized control. It is composed of a 
controller and a number of workers which communi- 
cate through message passing. 

However, with the support of physically distributed 
shared memory, it deems unnecessary to have a 
controller for task management (e.g. distribution and 
scheduling), and it would be less efficient for workers 
to "communicate" with each other through message 
passing. Instead of a controller and several commu- 
nication senders, a task queue server is used both for 
"coordinating" task execution and for exchanging 
information. A task queue server is actually a collec- 
tion of various queues residing in a shared memory 
buffer accessible by all the processors/workers. 
Each idle worker subsequently fetches a task and as- 
sociated data from the task-queue server for process- 
ing and posts resuHs to the shared memory buffer for 
later use. In this way, the proposed architecture is 
similar to the blackboard architecture employed in 
Carnegie Mellon’s autonomous land vehicle, Navlab 
[ 8 ]. 

To study the feasibility of using the proposed archi- 
tecture for real time vision processing, the task queue 
server has been implemented on an i860-based 
MC860VS [2]. Parallel algorithms for median filtering 
and contour extraction have also been designed and 
implemented on the MC860VS. Timing statistics has 
been collected and analyzed in order to measure the 
overhead for task management and to refine the pro- 
posed architecture. 

The remainder of this paper is organized as follows. 
Section 2 gives a brief description of the MC860VS. 
Discussed in Section 3 are the criteria considered in 
the design of the proposed architecture. Section 4 
presents the design of, and functionality provided in 
the proposed architecture, which is followed by ex- 
perimental results from an inKial implementation in 
Section 5. Concluding remarks given in Section 6. 

2. Mercury Computing Systems MC860 

A MC860 is i860-based array processor (AP). A 
MC860VS board contains four computing elements 
interconnected by a six-port crossbar switch. Each 
computing element consists of a 40 MHz i860 pro- 
cessor, a high-performance data switch, a DMA 
controller, up and to 16 MBytes DRAM (as shown in 
Figure 1). The MC860 acts as a peripheral to a host 
computer such as a Sparcstation or a 68040-based 
vxWorks platform [3]. 
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Figure 1 : The MC860VS block diagram 

The characteristics of computing elements includes 

1 . Eighty (Sixty) megaflop of computational power 
for single (double) precision operations. 

2. One hundred and sixty megabytes per second 
transfer rate to AP memory (through a high per- 
formance data switch). 

3. A 640 Mbytes/sec internal data transfer rate. 

4. A 4 Kbytes instruction cache and an 8 Kbytes 
data cache. 

5. A shared memory interface between the host and 
the MC860. 

6. A DMA controller that allows for a memory to 
memory transfer rate of 160 Mbytes/sec without 
the involvement of the CPU. 

The software supports for MC860s includes 

1 . An operating system (MC/OS) supports synchro- 
nization features such as semaphores and sock- 
ets, and others such as multi-tasking, various 
types of interrupts, and mapping of external 
memory, anrvsng others. It also supports a phys- 
ically distributed shared memory model for ease 
of parallel programming. 
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2. A Scientific Algorithm Library (SAL) and its ex- 
tended version (ESAL) consist of hundreds of 
micro-coded primitive functions designed to pro- 
vide fast memory throughput (and processing 
speed) by utilizing the data cache (via indirect or 
direct access) as an extended registers. 

3. A rich software development environment in 
which the user can develop an application in ei- 
ther of three ways as follows: 

• The Application Accelerator (Transparent) 
approach. 

• The Subroutine Engine approach. 

• The Multicomputer (Attached Processor) 
approach. 

The Transparent approach allows quick testing. The 
Subroutine Engine approach is suitable for SIMD 
types of parallel processing. The Multicomputer ap- 
proach is feasible for MIMD types of parallel process- 
ing, and is the approach used for the implementation 
of our proposed architecture. 

3. Design Consideration 

This section describes criteria considered in the de- 
sign of the proposed architecture. We first identify 
the characteristics of different vision tasks. Vision 
processing can be roughly divided into three levels: 
low-level image processing, mid-level feature pro- 
cessing, and high-level image understanding. Their 
characteristics are as follows: 

Low-level Image Processing : 

Data items are pixels, which are uniformly distributed 
in the image space. Operations include simple local 
or neighborhoi^ operations (e.g. thresholding, filter- 
ing, and edge detection) performed on a large 
amount of data. The inherent parallelism is fine- 
grained at a pixel level. 

Mid-level Feature Processing : 

Data items are 2D features such as points, lines, and 
regions of which the distribution in the image space 
are not uniform. There are a medium number of data 
items. Relations among data items are spatial rela- 
tionships such as adjacency, overlapping and 
containment. Operations on these data items include 
unary operations for computing geometric properties, 
and binary operations involving spatial relationships. 
The potential parallelism is medium-grained at either 
feature level or at a level of subsets of spatially adja- 
cent features. 


High-level Imaoe Understanding : 

Data items are natural or cultural objects or subparts 
of these objects, which are not uniformly distributed in 
the image space. Operations include matching be- 
tween objects (and their subparts) and possible 
models. The number of data items in general is 
small, but the number of possible models may be 
large. The potential parallelism is at the task level or 
at the level of subsets of the solution space. 

As reported in the literature [4] parallel image pro- 
cessing has been successfully applied to real sys- 
tems such as Warp and Connection Machine. A few 
systems (such as the CMU Navlab system [9]) have 
also been developed by exploiting task-level 
parallelism. However, some tasks are inherently 
time-consuming and may become bottlenecks during 
processing if only task-level parallelism is exploited. 
This problem can be overcome by supporting both 
task-level and sub-task level parallelism, in addition 
to pixel-level parallelism, in our proposed vision 
architecture. Moreover, the heterogeneous nature of 
different tasks/subtasks and the variation in process- 
ing time raise some crucial issues such as task 
allocation/scheduling, data/task migration, and bad 
balancing, that are not encountered in low-level im- 
age processing and scientific computing. The prob- 
lem is further complicated by the need to handle 
spatial features in vision processing (spatial-oriented 
operations, in particular) due to the dimensionality of 
spatial features, their complex data structures and 
tangled topological relationships. 

In the past, most parallel algorithms have been de- 
signed for a single operation at a time. The underly- 
ing assumption was that parallel algorithms for multi- 
ple operations could be obtained by pipelining those 
for single operations. This argument might be true for 
a sequence of local operations (such as fow-level im- 
age processing) where data distribution is more or 
less similar for all the operatbns. However, the ar- 
gument does not hold in general cases where data 
distribution varies with individual operations. In these 
cases, the overhead involved in data redistribution 
must be taken into account, and therefore the best 
algorithm for a single operation may no longer be the 
best choice when it is used along with other 
operations. In other words, an architecture for paral- 
lel processing must allow us to optimize the perfor- 
mance of a set of heterogeneous tasks as a whole, 
rather than the performance of each individual 
operation. 
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To maximize performance, many operations (i.e. 
tasks in the context of the proposed architecture) 
should be executed concurrently as long as no tem- 
poral ordering (i.e. dependency) exists between 
them. The task-level parallelism can be realized us- 
ing a task-queue mechanism (implemented as a Task 
Queue and a set of Subtask Queues in this work). In 
the task-queue mechanism, a task queue and sub- 
task queues are used to keep all the task, that are 
subsequently assigned to (or fetched by) any idle pro- 
cessor/worker. Task dependency can be handled by 
a resource mechanism and will be discussed later. 
To prevent potential bottleneck, any time-consuming 
task must be divided into smaller tasks by either di- 
viding the task into subtasks {task partitioning), or 
dividing the data set into subsets {data partitioning), 
or a combination of both. A mechanism for task par- 
titioning not only allows a vision system to achieve 
good load balance, but also makes it possible for us- 
ers to design a complex parallel/distributed program 
in a hierarchical modular fashion. That is, a program 
can be recursively divided into modules, submodules, 
and primitive operations. 

A straightfonward data partitioning method is to parti- 
tion the data set into many subsets and distribute the 
subsets to each processor using a task queue 
mechanism. However, in the domain of vision and 
spatial oriented processing, partitioning the data re- 
gardless of their spatial relationship usually results in 
the scattering of spatially adjacent data items among 
the processors which increases the overhead in data 
migration for tasks that involve operations on spatially 
adjacent data hems. The overhead may be reduced 
by using shared memory, rather than message pass- 
ing via sockets, for communication. There are other 
issues involved in partitioning spatial-oriented data. 
Readers are referred to [6] for a more detailed 
discussion. 

In summary, the proposed vision archhecture should 
be designed to: 

1 . support both task/subtask-level parallelism, 

2. use task/subtask queues for task management, 

3. use a resource mechanism, along with muhiple 
subtask queues, for task scheduling, 

4. hide the details of communication and task allo- 
catbn/scheduling from users, 

5. employ shared memory for communication, 

6. provide a mechanism for composing parallel vi- 
sion programs. 

A detailed description of the task queue server is giv- 
en in the following section. 


4. Task-Que ue Server 

The proposed real-time vision processing architec- 
ture is similar to the black board architecture. A 
physically distributed shared memory buffer 
“accessible” by all the processors is used for storing 
and fetching tasks (for execution). The block diagram 
of the proposed vision architecture is shown in Figure 
2. K consists of a master, a taskqueue server, and a 
number of workers which communicate with the mas- 
ter and with each other through a physically distribut- 
ed shared memory buffer. The master has a set of 
user defined functions for data/task partitioning, and 
for post processing (such as merging of partial results 
from workers). Each worker is facilttated with a set of 
user defined functions for task execution. 

The heart of the proposed architecture is a task- 
queue server residing in the shared memory buffer. It 
consists of a task queue, an internal task queue, a 
number of subtask queues (one associated with each 
worker/processor), and a reply queue. The task 
queue stores a sequence of tasks to be executed. 
These tasks may be recursively divided into subtasks 
(based on functionality) and queued in the internal 
task queue. For each task in the internal queue, 
there is a corresponding task handler (in the rrKxiule 
operated by the master) responsible for partitioning 
the task into a set of smaller subtasks (via data parti- 
tioning. These subtasks are then “evenly” placed 
into the subtask queues subject to certain con- 
straints, such as the locality of data on which the 
tasks will be executed or resource requirements, (e.g 
I/O devices). The constraints are used to determine 
the executors of the tasks. The reply queue is for 
storing information regarding the completion of sub- 
tasks. 

In the following, we shall give more detailed descrip- 
tion about important functionality provided by the 
proposed architecture, including task partitioning, 
task scheduling and allocation and communication. 

Task Partitioning 

The principal responsibility of each task handler is to 
partition tasks. To maximize concurrency (and hence 
performance), a time-consuming task should be par- 
titbned into a number of smaller subtasks so that the 
load of the task can be distributed equally over the 
workers. There are two techniques to partition a task: 
task partitioning and data partitioning. 

With task partitioning, a task is partitioned into a num- 
ber of smaller tasks which are usually of different 
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Figure 2; The block digram of the proposed vision architecture 


functionality and must be executed in a certain order. 
For example, a task to extract occluding contours 
from a noise image may be divided into subtasks for 
noise reduction, segmentation and contour extraction. 
The task for contour extraction may not be scheduled 
until the other two are completed. 

Instead of partitioning tasks by functionality, data par- 
titioning dwldes the input data into subsets, which are 
then processed by workers concurrently and 
independently. For example, given an image, the 


task of performing median filtering on the image can 
be partitioned by dividing the image into subimages, 
each of which is processed independently by a 
worker. 

To facilitate task and data partitioning, the proposed 
architecture provides several primitives, including 
CreateJTask, Create_Subtask, and Broadcast_Sub- 
task. The function Create_Task creates a new task 
that is placed in the Internal task queue, waKing to be 
scheduled. The function Create_Subtask creates a 
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new subtask. A task in the internal task queue may 
recursively call Create_Subtask to create a set of 
subtasks, that are placed in one or more subtask 
queues, waiting to be fetched by the associated work- 
ers for processing A subtask can be specified as 
either worker-dependent or worker-independent. If a 
subtask is worker-dependent, it must be executed by 
the specified worker. If it is worker-independent, it 
can be executed by any worker, although the speci- 
fied worker is preferred. A subtask can be marked as 
worker-dependent, when other workers do not have 
the resource (including data) needed for handling the 
subtask. 

The function Broadcast_Subtask aWows a subtask to 
be created and broadcast to all the workers. For ex- 
ample, the task Task_Exit is broadcast to all the 
workers at the end of processing for freeing resourc- 
es (including menx)ry, sockets, and semaphores). 

Task Scheduling 

As mentioned earlier, task allocation is carried out by 
using multiple subtask queues to keep spatial locality, 
and task scheduling is achieved by using a resource 
mechanism and a Depend_On call. It is important for 
a distributed system to detect when tasks need com- 
peting resources and to schedule tasks to resolve 
conflicts, in the proposed vision architecture, a re- 
source can be associated with a physical entity (e.g. 
a display device) or a virtual entity (e.g. data struc- 
tures). Resources are created with a capacity and 
tasks can be registered as using a number of 
resources. A task can be executed only when it 
needs no resource or the needed resources are 
available. 

By using the resource mechanism properly, synchro- 
nization between tasks can often be realized. For 
example, producer-consumer tasks can be coordinat- 
ed using the resource mechanism in the following 
manner. First, the information to be produced by the 
producer task is registered as a resource of no 
capacity. When the producer is finished, the capacity 
of the resource is increased. Therefore, the consum- 
er task, which is registered as needing the resource, 
will not be scheduled until the producer task is done. 

In addition to resource conflicts, there are also task 
dependencies between different tasks. Task depen- 
dency is handled by a function, Depend_On (with two 
tasks as parameters), that constrains a task to be 
scheduled only when the other task is finished. The 
function Depend_On, together with the resource 
mechanism, allow task-level synchronization to be 


specified. 

5. Experimental Results 

The initial version of the proposed architecture has 
been implemented on i860-based MC860VS with 
eight i860 processors, each with 16M memory. In the 
initial testing, the taskqueue server physically resides 
on the i860 processor where the master is running. 
Up to seven workers are running on the same num- 
ber of i860 processor. The objectives of the initial 
implementation and testing are as follows: 

1. To learn more about the characteristics (i.e 
strength and limitation) of i860-based MC860VS 
in the context of real time vision processing. 

2. To study the feasibility of using the proposed ar- 
chitecture for real time vision processing. 

3. To measure the overhead involved in using the 
task queue server for parallel processing. The 
amount of overhead will in turn be a guide line for 
determining granularity of parallelism used for 
real-time vision processing. 

4. To study the efforts involved in composing paral- 
lel programs for vision processing using functions 
provided by the proposed architecture. 

To achieve these objectives, parallel algorithms for 
several different types of vision operations have been 
implemented on the MC860VS including algorithms 
for median filtering and for contour extraction. Timing 
statistics for running these parallel algorithms on 
MC860VS have been collected for analysis. 

Median filtering is an pixel-level operation, and is eas- 
ily parallelizable. A typical approach is to divide the 
image (to be filtered) into N subimages. Each of the 
N subimages, along with the median filtering opera- 
tion forms a subtask to be processed by a worker. No 
explicit merging operation is required when ail the 
subtasks are processed. 

On the other hand, contour extraction involves con- 
version of data structures, i.e. the conversion of a 
pixel-level representation (image) into a feature or 
features (contours). One of the approaches to paral- 
lel contour extraction is to divide the image into a 
number of subimages, and to extract a partial contour 
from each subimage. It is not a "regular” operation in 
a sense that processing time on each subimage de- 
pends on the complexity/shape of the contour in the 
subimage. It is not a local operation, either, since an 
explicit merging operation is required to merge the 
partial contours extracted from the subimages to ob- 
tain the complete contour(s). 
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Experiments for collecting timing statistics were re- 
peated for different numbers of i860 processors 
(from one to four), for different numbers of subtasks 
(i.e. 1, 2, 3, 4. 6, 8, 12, 16, and 24, respectively). 

Figure 3 shows timing statistics on parallel median 
fiKering (on 256x256 images). For cases where only 
a single i860 was used, processing time remains rel- 
atively the same regardless of the number of 
subtasks. It implies that the overhead for task man- 
agement is negligible (when the granularity of paral- 
lelism is in the order of tens of milli-seconds) if all the 
subtasks are processed by a single processor. For 
cases where two i860s were used, processing time 
was unusually high (annotated as A2 in Figure 3) 
when the median filtering operation was divided into 
three subtasks. This was due to load imbalance. 
That is, one i860 had one subtask to process while 
the other had two. The same argument is applied to 
unusually high processing time annotated as A3 and 
A4 in Figure 3. It is interesting to point out that pro- 
cessing time at C2 in the figure is slightly higher than 
B2 and D2. This can be explained as follows. The 
size of each image on which median filtering was per- 
formed is 256x256. The numbers of subtasks asso- 
ciated with B2 and D2 are 8 and 16, respectively. 
That is, the image was partitioned into subimages of 
the same size in each of the two cases. Load balance 
was achieved in these cases. On the other hand, the 
number of subtasks associated with C2 is 1 2 by 
which 256 is not dividable. As a result, load balance 
was more difficuH to achieve since some subimages 
had larger sizes than the others and took longer time 
to process. 

Load unbalance can also be observed at B3 and D3. 
The numbers of subtasks in these two cases are 8 
and 16, respectively, which are not dividable by 3, the 
number of processors. 



Figure 3: Timing statistics of running parallel 
median filtering on MC860VS 


A different characteristic in timing statistics on parallel 
contour extraction (as shown in Figure 4) can be 
observed. For example, in the cases where only a 
single i860 was used for contour extraction, process- 
ing time increased with the number of subtasks. The 
increase in processing time is not due to the over- 
head in task management, but due to that in merging 
the partial contours extracted by each subtask to ob- 
tain the complete contour. The overhead is so sig- 
nificant that not much speedup could be obtained by 
increasing the number of processors beyond three. 
For the cases where the number of processors is 
three (or four), the processing time decreases, reach- 
es a minimum, and then increases as the number of 
subtasks increases. This is due to the fact that a 
large number of subtasks (running on a relatively 
small number of processors) will snx>oth out variation 
in (and so decrease) processing time for extracting 
partial contours, but increase time for the merging 
operation. 



Figure 4; Timing statistics of running paralle 
contour extraction on up to four MC860VS 


To investigate the overhead due to task management 
in a multi-processor environment, another experi- 
ment was conducted to measure speedup factors by 
running parallel median filtering on one to seven i860 
processors. In this experiment, the number of sub- 
tasks was set equal to the numbers of processors so 
that load imbalance would not affect speedup 
calculation. The results are shown in Figure 5. The 
solid cun^e indicate the upper bound for speedup. In 
an ideal case where there is absolutely no overhead 
in task management and communication, the pro- 
cessing speed is supposed to increase linearly with 
the number of processors utilized for processing. 
The dash curve shows actual speedup. It can be 
seen that speedup is nearly linear when the number 
of processors is less than four. The performance of 
the system gradually degrade as the number of pro- 
cessors increases. The degradation is probably due 
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to (1) contention aniong processors for accessing 
various queues in the task queue server, and (2) 
overhead in inter-board communication. 



Figure 5: Speedup factors of runing parallel 
median filtering on MC860VS 


6. Concluding Remarks 

The task queue server and parallel algorithms for two 
vision operations have been designed and imple- 
mented on an i860-based MC860VS. Timing statis- 
tics has been collected to analyze the overhead for 
task management (including queue-access control 
and inter-board communication). For local opera- 
tions which do not require merging operations, such 
as median filtering, nearly linear speedup can be ob- 
tained for a small number of processors and proces- 
sor utilization (efficiency) gradually degrades as the 
number of processors increase. For any global op- 
eration which requires an additional operation to 
merge partial results, such as contour extraction, it 
may not be a good idea to divide the operation into a 
large number of subtasks. 

Based on experimental results from initial implemen- 
tation, it is expected that the proposed vision archi- 
tecture will provide a convenient mechanism for 
composing efficient parallel and distributed programs 
(vision programs in particular). However, it should be 
pointed out that a good architecture is necessary but 
not sufficient for achieving real-time vision processing. 
For example, the first target application of the vision 
architecture is vision guided grasping of free flying 
objects in space by the Extravehicular Activity Helper 
and Retriever (EVAHR) [1]. In order to assist real- 
time grasping, EVAHR’s vision module is required to 
provide poses (i.e. the orientations and locations) of 
to-be-grasped-objects to its arm/hand controller at 1 0 
Hz (i.e. 0.1 second per pose estimation) [10]. Median 
filtering and contour extraction are part of prepro- 
cessing before pose estimation can be performed. 
Experimental resuhs seem to indicate that it may take 


more than 0.14 seconds to perform only median fil- 
tering with 7 processors. This problem can be allevi- 
ated by applying median filtering only to subimages 
from which accurate information needs to be 
extracted. In other words, real-time vision processing 
cannot be achieved without efficient (sequential and 
parallel) vision algorithms and a good parallel vision 
architecture (along with powerful parallel processors). 
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At>$tragt 

The ability of a autonomous space robot to grasp a 
freely translating and rotating object is being tested in 
the simulated microgravity environment aboard a KC135 
airplane. The Extravehicular Activity Helper/Retriever’s 
(EVAHR’s) arm trajectory planner continually requires a 
current estimate of the target's translational and rotational 
state. The target’s attitude, angular velocity and angular 
acceleration define its rotational state and the target’s 
translational state include its position, velocity and 
acceleration. Estimators have been developed based on 
the extended Kalman filter (EKF) algorithm. The 
KC135 microgravity environment does not have a 
convenient inertial reference frame for the translational 
dynamics and therefore, the translational as well as the 
rotational object dynamics are described by nonlinear 
equations. The estimator algorithms require intensive 
mathematical computation and therefore, I860 
microprocessors are used so that the software will run in 
real time. Estimator design, implementation concerns 
and issues specific to the KC135 environment are 
discussed. Translational state estimator performance 
results from simulation testing and from real-time 
integrated system testing are presented. 

KC135 Experiment 

One of the objectives of the Automation and 
Robotics Department at the Johnson Space Center is to 
design and develop a free-flying autonomous space robot. 
The immediate goal of the EVAHR project is to have the 
EVAHR grasp a freely translating and rotating object in 
a microgravity environment. This environment is 
simulated in the cabin of a KC135 airplane by having 
the plane fly along a parabolic trajectory. Sections of 
the robot that are necessary for the experiment are bolted 
to the KC135 cabin floor or are secured in some other 
fashion. Included are a Robotics Research arm, an 
inertial measurement unit (IMU), a vision system, 
release mechanism, cages containing the I860 processors 
and other computer equipment. There are two candidate 
vision systems: PRISM3 and the Perceptron scanner 
(2). The PRISM3 position measurement rate is 20-30 
Hz and the position measurement rate of Perceptron 
scanner is 5 - 8 Hz. 

KC135 Environment 


The gravitational forces experienced in the cabin of 

the KC135 change from above 1.5 g (1 g = 32.2 ft/s^) 
during "pull-up" to less than 50 mg of microgravity in 
approximately 7 seconds. The microgravity 
environment O^ss than 100 mg) lasts approximately 
20 seconds; however, the microgravity period during 
which the target is relatively stationary in the Robotics 
Research arm’s work space usually ranges from 0 to 10 
seconds per parabola. This decrease is caused by the 
fluctuations of up to 100 mg in the gravitational 
acceleration during the microgravity portion of the 
parabola. These undesired microgravity fluctuations 
cause the relatively stationary floating target to fly 
against the ceiling or floor of the airplane. 

Another characteristic of the KC135 environment is 
the usual initial fluctuation in the vertical acceleration 
as the airplane enters the microgravity portion of the 
flight. This is illustrated in Fig. 1. To avoid premature 
target release and the resulting undesired target dynamics, 
a release indicator mechanism was developed. The 
vertical acceleration, the pitch rate, and the pitch 
acceleration are monitored to determine the proper time 
to release the target. These measurements are taken 
from accelerometers and 3-axis gyroscopes. The 
microgravity portions of the flight that experience 
minimal pitch acceleration tend to be the better quality 
parabolas. 


Microgravity Vertical Acceleration 



Time in Seconds 


Fig. 1 
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Relative Translational Dynamics 


( 2 ) 


The KC135 cabin environment and the EVAHR 
instrumentation do not provide a practical inertial 
reference frame which can be utilized in modelling the 
target's translational dynamics. The EVAHR 
translational estimator coordinate frame, which is a 
Cartesian coordinate system centered at a comer of the 
IMU plate, is essentially an accelerating reference frame. 
At present, the EVAHR hardware does not include a 
sensor that gives position expressed in an earth reference 
frame (differential GPS etc.). Double integrating the 
output of the accelerometers for 1 - 2 hours without 
another correcting measurement is undesirable. EVAHR 
instrumentation does provide enough information to 
calculate the target *s acceleration relative to the robot. 
There are several implicit assumptions in determining 
the relative acceleration. One is that the EVAHR's and 
target's gravitational accelerations are equal in magnitude. 
It is also assumed that the only force acting on target is 
gravity and third, the atmospheric drag acting on the 
target is zero. The relative acceleration of the target with 
respect to the EVAHR is given by 

Ea^ =-EcOgX ^cOgX x 

-2* ^(OgX (1) 

wh^e 

is the EVAHR's angular velocity (filtered 
gyro reading) 

is the EVAHR's angular acceleration 
is the EVAHR's translational acceleration 
(filtered accelerometer reading) 
is the target's position relative to EVAHR 

is the target's velocity relative to EVAHR 
is the target's acceleration relative to EVAHR. 


c c At p p 

= *^vpastj + Y + *^apastq>) 

= ^rpastj. + Y j + ^vpastj.) (3) 

where 

^apaslrp is the target's relative acceleration 
during the last iteration 
^vpastj. is the target's relative velocity 
during the last iteration 
^rpastp is the target's relative position 
during the last iteration 
At is length of iteration interval. 

(iii) recompute ^a-j- based on updated and 

(iv) recompute (ii) based on updated 

KC135 Translational EKF Filter E)esign 

The EKF system model for the KC135 translational 
state estimator may be expressed as 

^=f(x(t)) +w(t) (4) 

and the measurement model may be expressed as 

Zj^ = H x(lj^) + Vj^ (5) 

wh^ 

x(i) is the Slate vector 

w(t) is a zero mean white process 

Vj^ is a zero mean white sequence. 

For the EKF algorithm, the state may be written as 

x(t) = x*(t) + Ax(t) (6) 


For computational reasons, during each calculation 
time period (iteration) the acceleration is determined 
twice. The first calculation uses the target's position and 
velocity estimates from the previous iteration. Next, the 
position and velocity estimates are updated using the new 
relative acceleration. The acceleration is then again 
determined using the updated position and velocity 
estimates. The relative velocity and relative position are 
determined in each iteration as follows 

(i) compute the relative acceleration, based on 
the target’s velocity and position estimates of the 
previous iteration 


where 

x*(t) is an estimate of the state. 

Combining equations (4) and (6), and expanding to the 
first order, 

+ ( 7 ) 

Therefore, the linearized model maybe expressed as 


(ii) compute 


dAx(t) bf(x(t)) | 
dt bx x(0 = x'*(0 


Ax(t) + w(t) 


( 8 ) 


The state matrix calculation is determined by 

Fit (9) 


whCTC 

- Jx *(*k) = X* (tk) 

At is the time interval between ^k+l 


This calculation is extended out to the second order. 
Accuracy requirements had to be balanced against 
computer computation time requirements in this 
determination. The Kalman filter process noise 
covariance mauix, Q(tjj^i). is computed according to 


X 


p - 3 target position components 
V - 3 target velocity components 
cOg -3 EVAHR gyro errors 

ttg- 3 EVAHR angular acc. errors 
32g- 3 EVAHR accelerometer errors, 


(15) 


The IMU readings and the target's position and 
velocity estimates, all of which have some error, must 
be used to determine the target’s acceleration, which is 
given by equation (1). Now the measured angular 
velocity and the EVAHR angular acceleration can be 
written as 

(0 =©j + 5(0 (16) 


Q(‘k-Fl) = ‘*»dk+l- y H a 


a = Oj + 5a 


(17) 


+ Qdfc) (10) 

whoe 

a is a sparse matrix whose nonzero elements are 
ass(x:iated with the variances of the white Gaussian 
noise of equation (20). 

The Kalman filter error covariance matrix is propagated 
according to 

P(lk+l) = ‘*>(tk+l*lk)P(tk)‘^'^(lk+l-lk) 


where 

0) is the vector of filtered EVAHR gyro readings 
(Oj is the vector of true EVAHR angular velocities 
8o) is the gyro error vector, 
a is the vector of computed EVAHR angular 
accelerations 

otj is the vector of true EVAHR angular accelerations 
8 a is the angular acceleration error vector. 

Similarly, the EVAHR acceleration may be written as 


+ Q(tk+i) (11) 

The state, x(tj^+), is revised by the filter according to 
x(t^+) = x(t^-) + K(y [z(y - Hx(tjj,-) ] (12) 

and the Kalman gain is given by 

K(tk) = P(V)H'^[HP(V)H’r + R]-l (13) 

where 

R is the position measurement error covariance 
matrix. 

Also, the Kalman filter error covariance is updated by 
P(tj^+) = [I - K(gH]P(ti^-) (14) 

The Slate vector specific to the relative translational 
stale estimator is defined as follows: 


^2 ~ ^2i ^®2 (1^) 

where 

^2 is the vector of filtered EVAHR accelerometer 
readings 

is the vector of true EVAHR accelerations 
8 32 is the accelerometer error vector. 

To form the state matrix, the time derivative of the 
target’s velocity error is expressed as 


~^= A8p + B8v + C8(o + D8a - 832 
where 


A = 


2 2 
(Oj +CO2 

-coqCO y~^2, 

-(OQO)2+a2 


-CO j0)Q-i-a2 

2 2 

0)2^+(Oo^ 

-coj(02-aQ 


-(0 2C0Q-aj 

-(O2CO 2 "^1^2 

2 2 
COq +C0j 


(19) 
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0 

2«2 

-2(0 1 

B = 

-20)2 

0 

2(Oo 


2(0j 

-2o)o 

0 


■ ^11 

321 

831 

C = 

^12 

^2 

832 


- ^13 

®23 

— 1 


V is the estimate of the target’s velocity relative to 
the EVAHR 

''true ■ '' 

P j a 3x3 diagonal matrix with the value p j 
P2 is a 3x3 diagonal matrix with the value P2 
iij is the white Gaussian noise vector whose 
value is determined from gyro noise tests 
and from system testing 
02 is the white Gaussian noise vector whose 
value is determined from accelerometer noise 
tests and from system testing 


— ®iPi'®2P2 

321 = -®oP2+2®iPo'2v2 

a3i = 2(02Po-tOoP2+2vi 
ai2 = 2(OoPi-tOiPo+2v2 

322 “ *®^2P2'®0P0 

832 = -(0iP2+2t02Pi-2vQ 
^13 “ 2(OqP2-<02Po‘2vi 
823 = 2(OjP2-0)2Pi'*'2vo 

833 = -0 )qPq-©iPi 


D = 


0 -P2 

P2 0 
L-Pi Po 


Pi 

-Po 

0 


Therefore, the p8rticular EKF system model is given by 




0 

0 

0 

M 

0 

1 


”5p " 

6v 


ABC D -I 


5v 

5(0 

— 

0 0 -Pi 0 0 


5(0 

5a 


0 0 0 -Pi 0 


5a 

_5a2 - 


_0 0 0 0 -p2- 


_5a2- 


0 

0 


+ 


Lo2 


( 20 ) 


Equation (20) has the same form as equation (8). 

Rotational State Estimator 

The rotational state estimator uses quaternions to 
describe the target's attitude and utilizes an EKF 
algorithm (^) . For further information on quaternion 
estimation refer to Bar-Itzhack's work (^). Bierman's U- 
D factorization (^) technique was implemented to test for 
improved performance. An initial difference in 
performance was noted between the conventional EKF 
and the U-D factorization algorithm. However, after 
convergence, there was not a marked difference in 
performance. 

The essential difference between the rotational state 
estimator intended for the earth orbit scenario and the 
KC135 rotational state estimator is the definition of the 
inertial reference frame. For the KC135, the inertial 
reference frame for the target's attitude will be the 
EVAHR's coordinate frame when the microgravity 
portion of each parabola commences (t=0). The 
EVAHR's attitude will be determined by integrating the 
output of the gyros from the commencement time (t=0). 

Results 

The performance results shown in Fig. 2 are the 
product of a simulation test. A program was developed 
that simulates the dynamics of KC135 airplane, the IMU 
sensor readings, and the dynamics of the target. The 
added noise is white Gaussian noise. The Euclidian norm 
error of the position vector estimate and of the position 
vector measurement are shown. 


where 

p is the estimate of the target’s position relative to 
the EVAHR 


SP= Ptrue - P 


747 


Simulation Test Results 



Time in Seconds 

Fig. 2 


Testing of the Perceptron scanner is ongoing and 
calibration of PRISM3 vision system is in process at the 
writing of this paper. Results shown in Fig. 3 are 
preliminary. For this test the target was stationary and 
Fig. 3 shows the target's x-position estimate as well as 
the x-posilion measurement. Once calibration is 
complete, the values of measurement error covariance 
matrix R of equation (14) will be modified. 


Real-Time Test Results 



Time in Seconds 

Fig. 3 

Implementation Issues 

For the arm trajectory planner software to be able to 
receive the target’s state at the rate of 100 Hz, the relative 
translational state estimator had to be divided into a 
repropagation unit and real-time propagation unit. The 


repropagation unit accepts dated position measurement 
inputs, compares the measurement to the appropriate 
archived state, revises the state, and then repropagates the 
state to present time. The updated state is passed to the 
real-time propagation unit. This module replaces its 
target state and Kalman filter parameters with the new 
updated state and new Kalman filter parameters. The 
real-time propagation unit continues to propagate the 
state and appropriate Kalman filter parameters while the 
repropagation unit has accepted another dated position 
measurement and is revising another archived estimate. 
The architecture is shown below in Fig. 4. 

The estimators are implemented on Mercury I860 
processors because of their fast computation capability. 
Closed form solutions were used where possible and 
matrix computation decreased by taking advantage of 
sparse matrices. 

A sensor bias test is performed just prior to the start 
of the flight, and these values are used to correct the 
IMU readings. The expected IMU drift during the flight 
was determined by running drift tests for a length of 
time comparable to flight duration, (1-2 hr). The 
sensor outputs are also passed through hardware and 
software filters before the readings are fed into the 
estimator at the rate of 100 Hz. 

Summary 

The design and performance of the KC135 
translational state estimator were discussed. Also 
presented were issues specific to the KC135 microgravity 
environment. 
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Abstract 


This paper outlines the design of a visual tracking system 
for use on the Extra-Vehicular Activity Helper/Retriever 
(EVAHR) autonomous robot during tests on board 
NASA’s KC-135 Reduced Gravity Laboratory. Issues 
such as the laboratory environment, mission require- 
ments, real-time constraints, and computational loads will 
be examined. 

EVAHR is an autonomous robot designed to perform a 
number of tasks in an on-orbit microgravity environment. 
One of the critical tasks of EVAHR is the ability to grasp 
a freely translating and rotating object. This task is the 
current focus of investigation by the EVAHR develop- 
ment team. To perform this task, EVAHR must analyze 
range image generated by the primary visual sensor to lo- 
cate and focus its sensors on the target so that an accurate 
set of object poses can be determined and a grasp strategy 
planned. This may involve positioning the sensor with its 
gimbal (pan/tilt) system and adjusting sensor parameters 
to provide the perception system with the best possible 
views of the target. The tracker must also provide the in- 
formation that it extracts about the target to pose and state 
estimation modules. These tasks must be performed at a 
frame rate of -9 frames per second. 

1. Introduction 

The EVA Helper/Retriever (EVAHR) is a prototype robot 
currently undergoing development at NASA’s Johnson 
Space Center. The goal of the EVAHR project is to de- 
velop an autonomous robot which can carry out on-orbit 
tasks such as inspection, worksite preparation, Orbital Re- 
placable Unit (ORU) changeout, and crew and equipment 
retrieval (CERS)^ 

The initial task chosen by the EVAHR team for investi- 
gation is the CERS task. The goal for this task is to 


autonomously grasp freely floating and rotating objects. 
Given the situation in which an object (crew member, tool, 
ORU, etc.) becomes unteathered and drifts away from a 
work area, the EVAHR must locate, rendezvous, grapple, 
and return with the object in a reasonable amount of time. 


To accomplish this task in a realistic manner, a system con- 
sisting of a Robotics Research k-807i arm, a dexterous 
manipulator, a Perceptron laser range scanner mounted on 
a high-speed pan/tilt, and an instrument package consisting 
of gyroscopes and accelerometers (an Inertial Measure- 
ment Unit, or IMU) has been assembled, and a series of 
flights aboard NASA’s Reduced Gravity Laboratory (KC-1 
35 aircraft) has been scheduled^. Software for this task 
consists of an object tracker, a pose estimator, a set of state 
estimators (translational and rotational) and an arm/hand 
control module^. 

To prepare for this task, the software modules were tested 
and debugged against an on-orbit simulator^ for testing the 
EVAHR software over an entire rendezvous and grasp. 
The simulator models the environment around a space sta- 
tion and maintains dynamics on the EVAHR and any other 
free-floating objects. The simulator also generates range 
images of the environment with the same general charac- 
teristics as the Perceptron laser range scanner. This simu- 
lation also has the ability to remove some modules and 
substitute perfect data to others, which provides an excel- 
lent testbed for debugging modules and verifying 
algorithms. 

The second phase of investigation consists of a series of 
experiments aboard a KC-1 35 parabolic aircraft. There are 
four sets of flights scheduled. The first flight set consisted 
of a series of flights using the IMU to measure and record 
the aircraft motion during the parabolas. This information 
allowed the EVAHR team to characterize the aircraft mo- 
tions and design the state estimators and the target release 
indicator needed for the following flights. 


Copyright ©1993 by the American Institute of Aeronautics and 
Astronautics, Inc. No cc^yright is asserted in the United States under 
Title 17, U.S. Code. The U.S. Government has a royalty-free license to 
exercise all rights under the copyright clamed herin for Governmental 
purposes. All other rights are reserved by the copyright owner. 


For the second flight, a simplified tracker program was 
flown with the laser scanner and pan/tilt device to evaluate 
the hardware performance and to collect data on object 
motion during the flight. The robot arm and hand were 
also flown but each ran a series of independent tests to ver- 
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other EVAHR Modules 

Figure 1: EVAHR perception modules and data flow 
ify the hardware and software control module performance. 

The third flight consists of flying the entire hardware con- 
flguration in a closed loop with the tracker, translational 
state estimator, and arm control modules, to grasp a ball 
during micro-g. Finally, in the fourth flight, the system 
will grasp a rotating polygonal target with the stage three 
system, combined with the pose estimator and rotational 
state estimator modules. 

Phase I and flights 1 and 2 of phase II have been completed. 
Flight 3 is currently scheduled for February 1994 and flight 
4 is scheduled for late Q1 1994^. 

The primary sensor for the EVAHR is a Perceptron laser 
range scanner mounted on a pan-tilt device. This scanner 
provides 12 bit range images over a 15 meter range. The 
scanner generates an image by sweeping a modulated in- 
frared laser across the field of view using two mirrors, a 
spinning mirror (for each scan line) and a panning mirror 
(to move the scan lines up and down). The depth at each 
pixel is measured by comparing the phase of the returning 
laser signal. 

The scanner has three speeds of operation (the slower the 
speed, the higher the spatial resolution) and a variable ver- 
tical field of view. At it’s highest speed, the scanner 
produces 2.25 frames per second at 256x256 pixels, or 9 
frames per second at 64x256 pixels. The latter is the set- 


Vision Modules 


ting sued in most of the EVAHR experiments. The scanner 
also provides a pixel registered 12 bit intensity image 
along with the range image. 

There are three computational components that currently 
make up the perception system (figure 1). First, an object 
tracker performs a series of image processing tasks, and 
identifies the objects in the range images from frame to 
frame. The second module is the pose estimator^, which 
calculates the object location and orientation (pose). The 
third module consists of the object state estimators^, which 
maintain a real-time state of the object’s rotation and 
translation. These three modules form a cross connected 
loop, in which each module provides information to the 
other modules as information becomes available. 

This paper describes the tracker module of the EVAHR. 
Section 2 describes general design of the tracker module 
and the results of the initial test series in the on-orbit 
simulation. Section 3 describes the stage Il/fligbt 2 tracker 
and the results of that flight. Section 4 describes the stage 
Il/flight 3 tracker, focusing mainly on the lessons learned 
from flight 1. Section 5 discusses the added features re- 
quired for the stage Il/flight 4 tests and section 6 is a 
summary and a discussion of the future plans for the vision 
loop and the tracker module in particular. 

2. System design and on-orbit simulations 
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Figure 2: Physical connections for EVAHR percpetion hardware. 


NASA’s Reduced Gravity Laboratory is a KC-135 aircraft 
which is flown in a parabolic path to produce up to 25 sec- 
onds of freefall. Of this 25 seconds, roughly 9-15 seconds 
have less than 30 milli-g of Z axis accelerations. This is 
the period in which EVAHR must perform it’s grasp. 

To accomplish this task, the tracker module was designed 
as the first stage in the perception system. In the prelimi- 
nary design, the tracker module had five tasks^: 

1. Locate the target in each image. This also involves 
locating and marking the arm in the image. 

2. Send the target position to the translational state 
estimator. 

3 . Send target contour data to the pose estimator. 

4. Maintain an internal object database on target param- 
eters (position, velocity, confidence, etc.). 

5. Calculate the scanner configuration values and gimbal 



Figure 3: Sub-program organization and data flow in 
the tracker module. 


positions to maintain a lock on the target. 

The code to perform these tasks was tested extensively on 
an on-orbit simulator. Although the simulator provided an 
environment in which techniques could be developed and 
data structures could be ironed out, there were several 
shortcomings. First, the data provided from the scanner 
simulator was noise free. This made object segmentation 
trivial as pixels were either object, EVAHR arm, or 
background. Also, control of the scanner parameters and 
gimbal positions was instantaneous and took nothing more 
than a message to the scanner simulator. Finally, the scan- 
ner simulator did not perfectly simulate the Perceptron 
scanner, neither in the geometry of the image that it gener- 
ated, nor in the way in which it could be configured. 

For the flight test tracker module, several additional tasks 
were added: 

6. Command and control of the scanner and gimbal. 

7. Maintain a frame synchronization on the Perceptron 
scanner. 

8. Optionally display or store images periodically for op- 
erator feedback or off-line analysis. 

Each of these tasks, plus the ones listed above, must be 
performed in the 0. 1 1 second cycle provided by the gener- 
ation of the images. 

To accomplish this, the tracker module is split into several 
concurrent sub-programs running on four separate com- 
puters: two 68040-based, vxWorks machines and a Mer- 
cury i860-based machine mounted on a VME backplane, 
and a Sun Sparcstation connected to one of the vxWorks 
machines via ethemet (figures 2 and 3). A host program, 
running on an ’040 spawns the other programs, synchro- 
nizes them, and commands the scanner and gimbal. Also 
on this ’040 are the image move, image display server, and 
image dump sub-programs. The image move sub-program 
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moves images from scanner memory into a local buffer on 
the ’040. The image display server sub-program connects 
to the image display sub-program running on the Spare- 
station via ethemet and sends the locally stored images as 
fast as the bandwidth of the ethemet allows. The image 
dump sub-program takes locally stored image and stores 
them on a backplane-mounted hard disk. 

The second ’040 machine runs a sub-program (endscan) 
which performs two tasks: reset the scanner at the end of 
each image and maintain a data structure on the currently 
available image which records the address of the image in 
scanner memory and the direction of the scan. To maxi- 
mize the speed that it collects images, the Perceptron 
scanner collects images on both the down scan and up scan 
of the mirror (there is no vertical retrace). This means that 
while the even images are collected top to bottom, the odd 
images are coUected bottom to top, meaning that they wiU 
be "upside down” in memory. It is up to the controlling 
program to keep O^ck of the direction of the scan for each 
image collected (there is no indicator of scan direction). 

To collect data, the scanner must be commanded to 1) start 
the panning mirror to begin panning and 2) start collecting 
data*. This is done by setting a STOP ADDRESS for data 
collection, sending a "B" (for bi-directional scanning) over 
the serial port, and setting a FRAME REQUEST flag in the 
scanner status register. When the scanner begins storing 
data it asserts a FRAME BUSY bit in the status register 
and continues to assm it until the commanded number of 
pixels worth of data have been collected. The scanno* also 
records the address of the current 4K block that it is writing 
to in the status register. At the end of the scan the scanner 
de-asserts the FRAME BUSY bit, but does not stop the 
panning mirror’s motion. The endscan sub-program must 
poll the FRAME BUSY bit, and when it is de-asserted, set 
the STOP ADDRESS for the next frame and set the 
FRAME REQUEST flag before the mirrOT has reversed it’s 
direction and started motion again. It typically takes be- 
tween 2.15 and 10.77 milliseconds to perform this task. If 
endscan fails to perform this task, the FRAME REQUEST 
will not be recognized by the scanner and the up- 
scan/downscan synchronization will be lost. 

The main tracker sub-program runs on a Mercury i860. 
There were two reasons for the placing it on this platform. 
The first was the raw speed of the i860-based machine. 
The second was to facilitate communications between the 
tracker and the state and pose estimation modules. On the 
Mercury system, four i860 are connected by a crossbar 
switch, allowing shared memory access at internal memory 
access speeds. 

The first task of the main tracker sub-program is to seg- 
ment and identify each object blob found in each image 


from the laser range scanner. A unique object ID is given to 
each object that is found. As objects move relative to the 
scanner, their blobs move in the image frame. The tracker 
must maintain a proper object ID on each object in the field 
of view for each frame from the laser scanner, and send this 
information, along with 3D blob contour data, to the pose 
estimator. No object recognition is performed at this stage. 
To simplify the problem for the KC-135 experiments, it is 
assumed that a single object will be visible at the start of 
micro-g and that it is the target object. If, by chance, ad- 
ditional objects appear during micro-g, the tracker utilizes 
predicted location and object size to correspond the target 
object. 

3. Stage II/Flight 2: hardware validation and data collection 

The second flight in the planned EVAHR KC-135 experi- 
ments is to test the hardware on board the KC-135 during a 
flight to validate it’s performance and to collect data on 
object motions during the micro-g potions of the flight. A 
simplified version of the tracker program tested in the on- 
orbit simulator was used to try and keep the scanner point- 
ed and focused on the target (white ball) and to store 
images for off-line analysis. A micro-g indicator was also 
flown to inform the operator when to release the target. 

During this flight, a crewmember released the target in 
front of the scanner at the start of micro-g. The simplified 
tracker then segmented any discrete blobs in each of the 
range images generated by the scanner, picked the blob that 
was "most circular", and made sure that the mapper was 
focused and pointed at the target that it located in the 
image. Meanwhile, the image dump sub-program saved 
images to the local hard disk. No information about the 
object was saved or used in subsequent images. 

During preliminary tests aboard the KC-135, it was dis- 
covered that the first few seconds of micro-g are very 
noisy. That is, there are relatively large accelerations ex- 
perienced during this time. To detect when this period is 
over and "clean" micro-g occurs, a release indicator was 
developed. The release indicator analyzes the data from 
the IMU and activates a light when the micro-g has 
stablized®. 

During some of the parabolas, the crewmember released 
the target when signalled by the release indicator while 
during others the crewmember used their own senses to 
estimate the proper time to release the target. In general, if 
the crewmember released the target when signalled by the 
release indicator rather than guessing on their own, there 
was less unwanted motion in the target object. 

It was discovered during early testing that moving the gim- 
bal while the scanner was scanning an image would inter- 
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rupt the scan and disrupt the upscan/downscan 
synchronization. This is due to a safety device which 
shuts down the laser if the spinning mirror speed is not 
within a specified tolerance. To overcome this problem the 
tracker waits for a scan to complete, commands the scanner 
to stop scanning, moves the gimbal, then commands the 
scanner to start scanning again. This is a rather time con- 
suming task, taking on the order of 0.75-1.0 seconds to 
complete. 

To keep from gimballing unnecessarily, the tracker makes 
use of a feature of the scanner in which the vertical field of 
view can be adjusted to start and stop anywhere in the 60 
degree overall vertical field of view. This means that 
when the scanner is set to focus on a target with, say, a 15 
degree vertical field of view, and the target moves toward 
the top or bottom of the image, the scanner can be com- 
manded to adjust the field of view up or down, thus main- 
taining the target in the center of the image. The tracker 
only moves the gimbal when the top or bottom of the ver- 
tical field of view moves outside the 60 degree overall field 
of view, or if the target moves close to the left or right sides 
of the image (a 60 degree field of view at the motor speeds 
we use during the flight). 

During the off-line analysis, it was discovered that the 
tracker actually failed to track during most parabolas. 
There were two reasons for this: 

1 . Segmentation failed when the target moved too close 
to the walls or floor of the aircraft. 

2. The target moved out of the field of view during scan- 
ner reconfiguration or gimbal motion. 

Although data was successfully collected during the pa- 
rabolas that the tracker functioned correctly, nearly all data 
was lost for the rest. For the following test flights, signif- 
icant changes were made. 

Several lessons were learned from flight 1: 

1. Target segmentation using only the range image was 
inadequate. 

2. Control of the scanner and gimbal was too slow. 

3. Focus of the field of view was too tight, 

4. There was no search mechanism for lost targets. 

5. Having a crewmember inside the field of view com- 
plicated both the segmentation and target location. 

6. Waiting until the release indicator signals to release 
the target reduces the amount of unwanted target mo- 
tion dramatically. 

4. Flight 3: Ball grasp 

Flight 3 is the first flight where a grasp of an object is 


attempted. For this flight, it is imperative that the tracker 
maintain a lock on the target throughout the micro-g por- 
tions of the parabola. 

Using what was learned in flight 2, changes were made to 
the tracker software. The elements of the on-orbit simula- 
tion version of the tracker that were left out for flight 2 
were added in. These additions maintain a database of ob- 
jects seen by the tracker and attempts to maintain a corre- 
spondence between objects located in each image, and the 
objects in the internal database. Code was also added to 
located and label the arm in each image^. 

Other changes were made to the tracker to address the is- 
sues raised after flight 2. These changes included: 

1. Making the target white, the background black, and 
segmenting on the reflectance image generated by the 
scanner. 

2. Increasing the speed of the commands to the scanner 
and the gimbal. 

3. Adjusting the mechanism for focusing the field of 
view to be more conservative. 

4. Adding a mechanism for searching for the object if it 
becomes lost. 

5. Using a release mechanism to release the target, rather 
than a crewmember. This was actually planned for 
flight 3 to protect crewmembers from entering the 
workspace of the arm. 

6. Planning to rely on the release indicator to signal the 
target release. 

The most significant change to the tracker module is the 
change from segmenting based on the range image to seg- 
menting from the reflectance image. This requires that the 
interior of the aircraft be covered in black fabric and the 
ball to be painted white. The arm and hand must remain 
their original white/aluminum colors, however, so locating 
the arm/hand in the image still must be performed. 

Another problem that plagued the tracker during flight 2 
was the loss of the target while setting scanner configura- 
tions or moving the gimbal. This problem was solved 
using three techniques. First, the communications between 
the tracker and the scanner and gimbal controller was 
speeded up. The time now needed to change the scanner 
settings is -0.07-0.10 seconds, verses -0.3 seconds during 
flight 2. Also, the time for gimbal motion has been cut 
from -0.75-1.0 seconds, to <0.5 seconds. 

The second technique for attacking the target loss prob- 
lems is to adjust the vertical field of view to it’s maximum 
following a gimbal move. This maximizes the chance that 
the target will be in the field of view. Finally, when a gim- 
bal move is needed, the scanner is moved in both pan and 
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tilt, pointing the scanner at the target. This is opposed to 
the method used in flight 2 in which the gimbal was moved 
only in pan if the target is at the right or left edge of the 
image and moving only in tilt if the target is at the top or 
bottom of the field of view. 

During flight 2 there was no mechanism for recovering lost 
objects. If the target was lost, the tracker moved back to 
it’s home position. To correct this the tracker uses a lay« 
ered approach for reaquiring the target. If no target can be 
located in an image, the tracker commands the scanner to 
open the vertical field of view to it’s widest. If, after the 
scanner change, no target can be found, the tracker com- 
mands the gimbal to point the scanner at the predicted 
location of the target (at the estimated time that the gimbal 
motion should be complete). This predicted location is 
generated either from the translational state estimator or 
the internal tracker estimate of the target position and 
velocity. If there is no target found after these two opera- 
tions, the target is considered ’’lost". At this point the 
scanner and gimbal are reset to their starting positions and 
the tracker data structures are cleaned up and reset. 

Although originally scheduled for Q4 1993, flight 2 has 
been postponed until Q1 1994 due to hardware and KC- 
135 scheduling problems. 

5. Right 4: Polygonal object grasp 

For flight 4, the tracker software will remain generally the 
same, with the addition of the communications with the 
pose estimator module. This flight is currently scheduled 
for late Q1 1994. 

When the pose estimator is ready for data, the tracker sends 
a contour of the target object, labeled with both depth in- 
formation and which portions of the contour belong to the 
target (in the case of occlusion). Along with this informa- 
tion extracted from the image, the tracks passes along the 
state of the scanner and gimbal, and the estimated pose of 
the target (if it is available). 

There are two competing issues that the tracker must deal 
with during the flight 4 object grasp: 

1. The pose estimator takes a relatively long period of 
time to calculate the pose of an object without any pri- 
or knowledge of the target pose. 

2. The rotational state estimator must have data very fast 
during it’s initial start-up period in order to converge 
to a solution in a reasonable time. 

To overcome these issues, the tracker queues up the first 
three images worth of data to pass to the pose estimator, 
whenever the pose estimator is free. If the rotational state 


estimator gets bad data from the state estimator, or if it 
cannot converge with the three images worth of data that it 
receives, it tells the tracker to re-initialize. The tracker 
then queues another three images worth of data and the 
process is repeated. This task is performed as part of the 
main tracker image processing cycle. 

6. Summary 

The tracker module performs the first stage of image pro- 
cessing as part of a general perception system for EVAHR. 
In addition to locating a target object in a series of range 
image, the tracker must transmit the position of the target 
to the translational state estimator, transmit range data 
from the target contour to the pose estimator, and correctly 
configure the scanner and point the gimbal so that the tar- 
get stays inside the image. These tasks must me performed 
within the real-time constrains of the scanner frame rate 
and the environment inside NASA’s Reduced Gravity Lab- 
oratory KC-135. 

Of four sets of flights scheduled aboard the KC-135, two 
have been completed as of the writing of this paper. The 
first set of flights measured the motion of the aircraft with 
an inertial measurement unit (IMU). The second set of 
flights involved flying the laser scanner, the aim, and the 
hand in a series of independent tests. Also during these 
flights the tracker system stored images of a floating target 
to a hard disk for off-line analysis. 

The third set of flights will involve using the perception 
system, along with the arm and hand systems, to grasp a 
freely floating ball. The fourth and final set of flights will 
incorporate a pose estimator and rotational state estimator 
to grasp a polygonal target. 

The tracker module plays an important role in flights 2, 3, 
and 4. For flight 2, the tracker was tasked to maintain the 
scanner pointed toward a ball target throughout the micro- 
g portions of the flight. Although the tracker largely failed 
in it’s task, important lessons were learned. 

For flights 3 and 4, the tracker will maintain a history of 
each object that it detects in it’s images. It uses this infor- 
mation to maintain an object correspondence between 
frames. The tracker also has a method for searching for the 
target if by chance the target moves outside the field of 
view of the scanner. This allows the tracker to recover 
from unexpected events such as an unexpected aircraft 
motion. 
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AB.STRACT 

The EVAHR (Extravehicular Activity Helper/Retriever) 
robot is being developed to perform a variety of 
navigation and manipulation tasks under astronaut 
supervision. The EVAHR is equipped with a manipulator 
and dexterous end-effector for capture and a laser range 
imager with pan/tilt for target perception. Perception 
software has been developed to perform target pose 
estimation, tracking, and motion estimation for rigid, 
freely rotating, polyhedral objects. Manipulator grasp 
planning and trajectory control software has also been 
develq)ed to grasp targets while avoiding collisions. 

Hight experiments have been scheduled aboard NASA's 
Reduced Gravity Laboratory (KC-135 aircraft) in 1994 to 
attempt grasps of free-floating objects. A software 
simulation of the EVAHR hardware, KC-135 flight 
dynamics, collision detection, and grasp impact dynamics 
has been developed to integrate and test the EVAHR 
software. This paper describes the EVAHR system, with 
emphasis on the robotic and KC-135 simulation software. 

1. INTRODUCTION 

Much woilc has been done on autonomous grasping of 
stationary, complex objects, e.g. in bin-picking and pick- 
and-place problems! 1,2]. However, very little work has 
been devoted to autonomous grasping of moving, 
complex objects. This is probably due in part to the 
difficulty of the problem and to a lack of applications for 
such technology. There exists an abundance of 
applications for tihis technology at NASA, however, as 
recent Shuttle-based satellite recovery attempts have 
amply demonstrated. STS-49 required 3 astronauts to 
grasp the ailing INTELSAT VI satellite - an autonomous 
mechanism would have spared the astronauts hazardous 
EVA activity. The difficulties encountered on 41-C in 
1984 rescuing the Solar Max satellite and the failure to 
recover the Leasat 3 during 5 ID in 1985 provide further 
evidence of such a need. The recent ROTEX experiment 
onboard the Shuttle mission STS-55[3] demonstrates that 
the German Aerospace Research Establishment (DLR) 
considers this a key piece of robotic technology for space. 
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The ROTEX system combined a tele-operated manipulator 
and camera system to grasp a free-floating object, among 
other tasks. 

The EVAHR (Extra-Vehicular Activity Helper/Rehiever) 
project has been developing robotic technology, hardware 
and software, for navigating among and grasping free- 
floating objects since 1987. The EVAHR project was 
originally conceived as an autonomous device for 
retrieving personnel or objects which had become detached 
from Space Station Freedom (SSF). At the conclusion of 
Phase II in 1990, the following autonomous robotic 
capabilities were demonstrated on a precision air bearing 
floor (PABF): a) navigation among stationary obstacles 
and b) grasping arbitrarily located and oriented, cylindrical 
targets. At this time, a simulation of the EVAHR in an 6 
degree-of-freedom (DOF) orbital environment[4] was 
developed for use in the next phase. It would provide a 
testbed for development of 6 DOF navigation and grasping 
software. Models initially developed included orbital 
dynamics, plume impingement effects of the Manned 
Maneuvering Unit (MMU) (a compressed-gas-based 
propulsion system), and a laser scanner range/intensity 
image simulation[5]. Six DOF body motion control and 
orbital state estimation algorithms were developed and 
tested for navigation purposes. Phase III then took as its 
primary goal the development and demonstration of an 
integrated system for grasping free-floating, complex 
objects. 

The EVAHR team chose the KC-135 zero-gravity aircraft 
as the hardware testing and demonstration platform for this 
phase[6]. The KC-135 aircraft flies a series of parabolas, 
free-falling over the hump of each parabola, simulating 
zero-gravity or weightlessness for objects and personnel 
free to float inside the plane. Each of these periods lasts 
20-25 seconds. The EVAHR, attached to the floor of the 
KC-135 throughout the experiments, will attempt to 
grasp targets released in its envelope during the simulated 
zero-gravity period . While the KC-135 is a difficult 
environment for such testing, it provides the only direct 
means of testing the contact dynamics of free-space 
grasping short of a Shuttle flight experiment. 

Prior to flight, software to grasp free-floating, complex 
objects in a 6 DOF, orbital environment was developed 
and integrated. Software for an orbital environment was 
developed first and testing was conducted to determine the 
performance of the system[7]. Dynamics and state 
estimation software was then developed to represent the 
KC-135 environment. 
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2. QBJECTIVRS 

Essentially, the integrated EVAHR grasping software 
must be capable of autonomously grasping a 
variety of polyhedral objects, each in less than 
15 seconds, where the object is freely but 
slowly translating and freely rotating at up to 
30 deg/sec. 

A maximum time-to-grasp of 15 seconds was initially 
chosen as a conservative estimate of the zero-gravity 
period provided by the KC-135. While recent data 
gathering flights on-board the KC-135 have shown that 
the time available is actually less than 8 seconds, most of 
the testing presented here was conducted with the 15 
second per grasp attempt time limit One might speculate 
that in order to be considered useful in real applications, 
the robot would have to perform these grasps much more 
quickly, but this consideration did not affect our 
requirements. 

The maximum rotational rate is taken from a Crew and 
Equipment Retrieval Study (CERS) conducted by 
NASA[8], which estimated the likely maximum 
separation rates of objects which might become detached 
or untethered from the SSF. The CERS maximum 
translational rate of 2 feet/sec was ignored because the 
EVAHR, as a free-flying vehicle, would rendezvous with 
and stationkeep at the target with nearly the same 
velocity. From on-orbit navigation tests in the EVAHR 
simulation, a rate of 2 in/sec during stationkeeping was 
found to be the maximum. However, it would be 
impractical for the EVAHR vehicle to match the CERS 
rotational rates. 

To enable the EVAHR to meet these objectives, a 
metrically-accurate CAD surface model and mass 
properties (mass, center-of-mass, inertia matrix) are 
provided to the EVAHR describing each target object prior 
to runtime. The target's surface must be non-specular so 
that the laser scanner can image it. 

Integrated software testing with the orbital simulation 
yielded information about the performance of the EVAHR 
software in its ultimate target environment, i.e. the 
success ratio for different target velocities, the required 
vision update rates, etc. The addition of a KC-135 
dynamics model provided a means of integrating the KC- 
135 state estimation module and verifying continued, 
successful system performance. 

3. EVAHR HARDWARE 

The principal EVAHR hardware elements to be flown on 
the KC-135 flight experiments is shown in Figures 1 and 
2. These elements are combined into a fictitious EVAHR 
body in the orbital simulation shown in Figure 3. A 
Perceptron LASAR laser scanner is mounted in a pan/tilt 
mechanism on top of the EVAHR body. The Perceptron 
provides 64x256 range/reflectance images at 9 Hz, 


128x256 resolution at 5 Hz, or 256x256 at 2.5 Hz - 
switchable at runtime. The range resolution is 
approximately 1/7 inch, though noise can result in a range 
error of up to 3%. The pan/tilt motors are able to rotate 
the Perceptron at 180 deg/sec. A Robotics Research (RR) 
K807i 7 DOF manipulator is mounted as a left arm, with 
a 3-fingered JH4 dexterous hand as its end-effector. The 
RR arm has a maximum end-effector speed of 30 in/sec, 
static repeatability of .002 in, and may accurately support 
loads of 10 lbs. The dexterous hand was manufactured at 
JSC. It provides 3 joints per finger, though only 2 are 
active. The maximum finger joint speed is roughly 120 
deg/sec so that the hand is able to close in roughly 1/4 
sec. Proximity detectors are located on the palm and each 
finger tip. 



Figure 1. EVAHR laser scanner & pan/tilt 



Figure 2. EVAHR manipulator/hand 
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When the EVAHR is flown on the KC-135, the EVAHR 
hardware will be fixed to the floor of the aircraft - only the 
target will float free. In the orbital configuration, a 24- 
thruster Manned Maneuvering Unit (MMU) [9] is strapped 
to its back to provide 6 DOF propulsion. The MMU 
model was turned off during the grasping tests. Not shown 
in the figures is an inertial measurement unit (MU) 
composed of 3-axis accelerometers and rate gyros, which 
provide acceleration and rotational velocity measurements 
of the EVAHR body. The EVAHR will use the following 
computers to grasp free-floating targets in real-time 
onboard the KC-135: 8 Intel i860 processors connected via 
crossbar, 14 Transputer T800 processors networked via 
synchronous Transputer links, and 4 68040 processors. A 
VMEbus will connect the processors/ networks and 
sensor/actuator electronics. 



Figures 3. EVAHR grasping ORU in orbital simulation. 


4. EVAHR SIMULATION 

Figure 3 shows testing with the orbital simulation. The 
arm and laser scanner will be attached to the floor of the 
KC-135 in flight, as in Figures 1 and 2, rather than to the 
body. The KC-135 simulation includes models of the a) 
EVAHR hardware and b) dynamics of the KC-135 
environment. Models of the EVAHR hardware required to 
test autonomous vehicle navigation were developed 
initially. One of these models, the laser scanner image 
simulation software, continues to play a major role in 
vision-guided manipulation. This simulation takes 
advantage of the z-buffering graphics hardware associated 
with Silicon Graphics workstations to provide fast range 
image generation for polyhedral-surface objects. The 
model was extended to provide configurations possible in 
the Perceptron laser scanner - 64x256 or 256x256 images 
and variable field of view. The laser scanner simulation 
model does not try to model noise or distortions, however, 
other than uniform pixel noise. Preliminary tests with the 
actual Perceptron hardware show a maximum error of 3% 


in the range pixel measurements. Unfortunately, the 
Perceptron produces more systematic noise which is not 
modeled by this simulation. Target CAD models (see 
Figure 6) along with CAD models of the EVAHR 
hardware (see Figure 3), are used by the laser scanner 
simulation to generate simulated images. 

Additional models were developed expressly to support 
testing of the grasping software. These include models of: 
a) the RR manipulator and JH4 dexterous hand, b) 
collision detection, and c) grasp contact dynamics. Because 
dynamic models of manipulators are hard to obtain and 
compute-intensive, we opted for an acceleration-level 
model of the RR 7 DOF arm and JH-4 dexterous hand. 
This model has since been proven in tests with the actual 
arm. Collision detection software was developed to detect 
a) desired collisions between the JH-4 palm/fingers and the 
target and b) collisions that the manipulator trajectory 
control software is attempting to avoid. The collision 
detection software requires that the objects be convex or be 
composed of convex subparts. The dynamics of each body 
(e.g. EVAHR, target) is computed by sununing the forces 
and moments on each orbiting body, computing the 
associated translational and rotational accelerations, then 
integrating (at 100 Hz) to arrive at each body's velocity 
and position. One such translational force is gravity. 
When collisions between the fingers or palm of the JH4 
hand and the target are detected, additional forces and 
moments on the target are computed and "folded into" the 
(orbital) dynamics state propagation. To make the 
computation tractable, each finger is assumed massless 
relative to the target and stops on contact: however, its 
motion is resumed if the forces/moments on the target are 
such that the target moves away from the finger. The 
forces/moments are computed one contact point at a time, 
even when multiple fingers are in contact simultaneously. 
Assuming a loss of velocity after each contact due to 
friction, the motion of the target is considered "stopped" 
when the target's total velocity falls under a threshold. 
This simulation has not been validated except by visual 
verification of the results of random test cases. Numerous 
cases have shown the target slip out of the grasp of the 
closing fingers as well as the target being trapped by them 
- the behavior has speared "reasonable". 

5. EVAHR .SOFTWARE 

The EVAHR grasp software acts as a closed-loop control 
system (see Figure 4), where the simulation mMels just 
described act as the plant. The vision and grasping 
modules are as described in 7. All are implemented in C, 
as are the simulation models. 

Measurements of the target's pose (3D position/3D 
orientation) are computed by the vision modules 
Tracking[10] and Pose Estimation[ll, 12]. The Tracker 
processes each image to find a rough estimate of the 
target's location and, if the pose estimator is ready, the 
contour of the object. To avoid confusing the target with 
the arm/hand, which may be in the image or even partially 
obscuring the target, the arm/hand is removed from each 
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image. The is accomplished by using a CAD model of the 
ann/hand. Pose estimates are computed by finding the best 
match of image features (edges/vertices) on the occluding 
contour to corresponding features in CAD model 
representations of each target (see Figure 6). The pose 
estimation software uses the same CAD models as the 
laser scanner simulation uses to draw its images and thus 
benefits from perfect surface geometry models of the 
targets. These measurements are computed at rates which 
vary depending upon a number of factors: the 
complexity/symmetry of the object, the availability of 
predicted poses for the image time, the degree of occlusion 
by the EVAHR arm/hand, and of course the computing 
resources available. The output rate of the pose estimator 
may vary from .1 sec (laser scanner minimum image 
interval) to several seconds. 

These measurements are provided to independent Kalman 
filters which estimate the target's translational and 
rotational states. These model-based state estimators are 
different from those in the orbital EVAHR software, due 
to the different environment dynamics, and are described 
more fully in [13]. The rotational filter required only 
minor modifications, namely redefinition of the inerti^ 
coordinate system (CS). While the orbital version used an 
Earth-centered CS, the KC-135 version uses the aircraft's 
orientation at the beginning of each parabola. Thus the 
target's attitude is EVAHR-relative from that point on, 
due to the arm being fixed to the aircraft. The KC-135 
translational state estimator had to be made explicitly 
EVAHR-relative, due to the lack of the Earth-based 
position correction (e.g. Global Positioning System 
(GPS)) that was assumed in the orbital software. The 
target's state is thus estimated in the EVAHR's coordinate 
system, using inertial measurements from the IMU and 
initializations/corrections from vision. For the purposes 
of this round of simulation testing, both the 
measurements and corrections were taken, noiseless, 
directly from the dynamics simulation. Both the 
translational state (position, velocity, acceleration) and 
rotational state(attitude, rotational velocity and 
acceleration) states are computed at 100 Hz for the 
EVAHR Manipulator Trajectory Controller. They are also 
archived in the World Model database as predictive 
feedback to the Tracking/Pose Estimation modules for 
images to be processed shortly thereafter. 

The Manipulator Grasp Planning and Trajectory Control 
module[14] receives an estimated target state at 100 Hz. 
At 10 Hz, the Grasp Planner decides which of the multiple 
grasp regions available for the target is optimal for 
grasping. (Prior to runtime, the graspable regions on the 
target's surface were computed and placed in a database. 
These regions were found automatically based on CAD 
models of the JH4 hand and the target.) The Manipulator 
Trajectory Controller (MTC) computes RR/JH4 joint 
accelerations to track the motion of the chosen grasp 
region on the target at 100 Hz. Each cycle the MTC also 
computes joint accelerations to avoid a) joint 
singularities, b) joint limits, c) RR link-link collisions, 
d) RR-EVAHR body collisions, and e) collisions between 
the RR/JH4 and regions on the target which are not to be 


grasped. The MTC uses a potential field-based algorithm 
to compute these avoidance accelerations. 


EVAHR So ftw ais 

archived est tgt pos,att 


World 

Model 


Pose 

Estimation 


target l 
contour 
info 

(<10Hz) I 
Tracking 

- segmentation 

- correspondence! 


target pos, att I 
(1-lOHz) 


objs' pos| 
(lOHz) 


tilt 

angles 



est. tgt 
state 
(100 Hz) 


; 




P 

L_ 


range, ^ 

set 

refl 


FOV, 

images 

#lines 

(2.5-9HZ) 



mT---* 


Translational, 
Rotational 
State Estimation 


Manipulator 
Grasp Planning, 
Trajectory 
Control 


T 


desired 

arm/hand 


EVAHR joint accel 
accel, rotvel (100Hz)| 
(lOOHz) 


arnVhand 
joint 
states 
(100 Hz) 


Pan/ 

TUt 


Laser 

Scanner 


Measurmen 
Unit (IMU)i 

ZIZ 


objs’ trans, 
rot states 


Dynamics 

- orbital, KC135I 

- arm/hand F/M 

on target 
Collision det. 
Impact 


Arm/hand 
(7/10 DOF) 


arm/hand joint angles 


Simulation Software 


Figure 4. EVAHR/Simulation Software Architecture 


;;0.2 ^Translational state estimator initialized | 


3.6, 

3.7 

4.3* 


: Rotational state estimator initialized 

* GrasD command issued 

oPose estimator feedback initiated 


;10.4*^>3 JH4 fingers contact target 
; 1 1 .9 Successful run complete 


1 15.1 
time 
(secs) I 


Maximum time allowed 


Figure 5. Avg event timeline for NBox. 


760 



The MTC takes advantage of the extra DOF in the 
manipulator to minimize the effects on the end-effector 
trajectory. 

The cycle rate of 100 Hz was determined primarily by the 
need to control RR joint accelerations at that rate. It was 
also determined previously that propagation of the target's 
rotational state would diverge if the integration timestep 
were less than approximately SO Hz. 

The software was run on 2 Sparcstations and 1 Silicon 
Graphics 70GT for the experimental testing. Figure 5 
shows a representative sample event timeline for an 
attempted grasp of the NBox. 

Two different types of target shapes were used in testing: 
spherical and polyhedral. In the case of the sphere, the 
pose and rotational state estimation modules are 
unnecessary and grasping is greatly simplified as well. 
Therefore, polyhedral objects were used almost exclusively 
for testing. Three different polyhedral sh^s were used 
(Figure 6). 


Jettison handle (JettH) Orbital replacement unit (ORU) 


NBox Sphere 

Figure 6. Polyhedral and spherical target shtqies 

An additional major difference between the oibital and KC- 
135 software lies in the area of image segmentation. 

While the background of space provides an extremely 
convenient basis for segmenting the target in each image, 
the inside of the KC-13S aircraft is very difficult and 
compute-intensive. To minimiz e the computational load, 
a bl^k curtain will cover the inside of the aircraft to 
simulate space. 

7. CONCLUSION 

The EVAHR software for grasping a target floating free 
on-orbit has been described, as well as the KC-135 


simulation for testing the software as an integrated whole. 
The software was integrated and successful grasps were 
achieved, similar to those obtained in thorough, integrated 
testing with the orbital simulation. This result suggests 
that the software is ready for flight testing on-board the 
KC-135. 

8 . FUTURE WORK 

A preliminary KC-135 flight test using spherical targets 
exclusively will be ctmducted in the near future. The 
advantage of spherical targets is that they have no 
rotational state and, hence, neither the pose estimation 
software nor the rotational Kalman filter are required. A 
Teleos PRISM3 stereo-vision system, instead of the 
Perception laser scanner, will be used to provide target 
position inputs (at > 20 Hz). The translational state 
estimation and grasping software modules have been 
reimplemented on parallel Intel i860 and Siemens 
Transputer architectures to achieve real-time speeds. 
Calibration of the arm and PRISM3 systems has been 
completed and pre-flight integrated systems testing is to 
begin shortly. 

Meanwhile, modifications are being made by Perceptron 
to the laser scanner to improve its noise characteristics. Of 
primary concern is range drift, which is serious enough to 
prevent the sensor from being calibrated. Assuming that 
the problems with the laser scanner are corrected, a flight 
with the polyhedral targets is scheduled for sometime in 
the summer of 1994. 
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Abstract 

Task directed, active vision techniques are 
beginning to produce vision systems capable of 
providing real-time depth perception for robots. 
The concepts of shallow disparity filters and 
coarse disparity segmentation show particular 
promise for several reasons; they require 
minimal correlation search, thus they are fast; 
they employ redundant measurements, so they 
are noise tolerant; and they are not dependent 
on precise measurements, so they are 
calibration insensitive. We have extended these 
techniques by coarsely segmenting regions 
about the fixation point according to the task at 
hand. Relative occupancy of these regions 
provide cues to the approximate location of 
relevant features. They also provide reliable 
information for maintaining gaze stabilization. 
We have implemented these techniques on a 
real-time active stereo vision system to produce 
robust visual skills for tracking highly dynamic 
objects and occluding contours. These 
techniques are general in that they can be 
applied to a wide variety of objects. 

Introduction 

Vision has long been recognized as one of the 
most significant pieces of the autononwus robot 
control puzzle. The richness of information 
available in the visible spectrum makes vision 
unique from other sensing modalities [Mataric]. 
The vision research community initially 
endeavored, unsuccessfully, to assimilate all 
available data which resulted in computational 
overload. Much of the recent success in vision 
can be attributed to a transformation in thinking 
which has swept the vision community. Stereo 
vision, in particular, has seen a resurgence as 
the role of attentive mechanisms, foveation, and 
vergence in biological models has become better 
understood. Exciting developments in gaze 
control and task directed vision are now bringing 
long awaited goals within our grasp. 


Classical Stereo 

Stereo vision provides a means for depth 
determination through the principal of 
triangulation. If an object in a scene is located in 
the center of a stereo image pair, then that 
stereo system is said to be verged, or fixated, on 
the object. 

In the more general case, features in a scene 
will be viewed from a variety of depths. 
Assuming that there is some mechanism for 
matching (correlating) features between the 
stereo image pair, the feature disparity (the 
relative number of pixels from the optical axis) 
for each image is required to determine the 
object location. In addition to the baseline, the 
angle subtended by each pixel must be known 
for the stereo cameras. The accuracy of these 
measurements is dependent on the accuracy 
with which calibrated system parameters are 
known. 

Feature matching is a non-trivial problem in 
stereo vision. Extensive research has resulted in 
a number of schemes ranging from simple pixel- 
pixel correspondence or vertical line matching 
[Braunegg 90], to more sophisticated 
approaches which employ filtering of the images 
prior to matching in order to normalize the scene 
and accentuate its salient features [Marr-Poggio 
79). Still other schemes utilize a variety of 
primitives [Marapane-Trivedi 92] in order to 
determine correspondence. The approach taken 
to achieve correspondence is often driven by the 
characteristics of the domain in which the vision 
system is to be operated. 

Stereo Methodology 

When given the opportunity to acquire stereo 
and other forms of vision data, it is tempting to 
try to generate a complete 3-d map 
(reconstruction) of the environment. Early 
attempts at reconstruction consistently met with 
insurmountable difficulty due to the complexity of 
the real world (and its lighting conditions). Real- 
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time performance was only conceivable in 
extremely limited domains. 

When these vision systems were applied to 
robotic tasks such as navigation, the lack of real- 
time performance became most evident 
[Moravec 83J. Even if an accurate reconstruction 
were achievable in real-time, the task of "making 
sense" of the resultant geometry is 
computationally expensive. 

To complicate matters, 3-D matching algorithms 
require precise depth information in order to 
converge on a correct solution. Precise data 
requires precise calibration which is not only 
time consuming to achieve, but difficult to 
maintain [Grimson 93] when sitting atop a 
dynamic robot. Finally, biological models of 
successful stereo vision systems do not suggest 
a dependence on highly accurate "calibration". 

Biological models have provided some insights 
to the vision community which is turning away 
from reconstruction and towards approaches 
which can be managed in real-time. The 
prevailing attitudes held by the vision community 
include; 

• The desire to make computation more 
manageable by limiting computation to what is 
most relevant to achieving the task at hand 
[Ballard 91]. 

• The desire to exploit regularity in the 
environment to allow for reduced complexity 
(Successfully demonstrated) [Horswill 93], 

• The need to tightly couple sensor control with 
what is being perceived (active vision). 

The last concept is common in the stereo vision 
community which is particularly concerned with 
fixation [Ballard 91] and gaze (vergence) 
[Coombs-Brown 91] control. 

One of the most difficult problems in stereo 
vision is matching; however, recent techniques 
for reducing the computation associated with 
correlation have met with success [Grimson 93]. 
By reducing the range over which a match is 
likely to occur, the search space and the false 
match rate are reduced [Olson 93]. This 
approach may be extended in order that shallow 
disparity regions can be used for local 
segmentation in depth [Grimson 93]. Such 
approaches have only begun to be explored but 
show much promise. 


In support of robot activities it is necessary for 
vision systems to provide relevant information in 
real-time. Even when techniques for 
computational simplification have been 
employed, a large amount of data must be 
processed in order to be useful to a robot. An 
active, real-time stereo system is still a "complex 
beast". Few institutions [Nishihara 90] 
[Marapane-Trivedi 92] have the resources 
necessary to assembie and program the 
specialized hardware required for real time 
stereo vision. However, it would be more difficult 
to simulate the interaction between a noisy, 
dynamic world and an active stereo vision 
system than to build the real thing. These are the 
primary reasons for the paucity of real-time 
stereo vision research. This will improve as 
computing hardware and digital cameras 
become faster and more accessible. 


PRISM Stereo Vision 

The following discussion concerns our work in 
active stereo vision. The objective of our R&D 
efforts is to provide needed visual perception 
skills to the mobile and articulated robotic 
systems we support. This stereo vision project, 
which has been ongoing for approximately one 
year, employs a PRISM vision system which will 
be described briefly. 

The PRISM [Nishihara 84] [Nishihara 90-B] 
stereo vision system is the embodiment of Keith 
Nishihara's biologically inspired Sign Correlation 
Theory [Nishihara 90-A]. Nishihara's theory is an 
extension of Marr and Poggio's Zero-Crossing 
Theory [Marr-Poggio 79] . The Zero-Crossing 
Theory was found to be effective at extracting 
the salient features of an image in a normalized 
fashion, but the matching algorithm did not 
perform weli in the presence of noise. Sign 
Correlation is the dual to Zero-Crossing contour 
correlation, yet it is highly tolerant to noise and 
preserves the intent of Zero-Crossing Theory 
(Nishihara 90-B). 

The PRISM system employs dedicated hardware 
to provide spatial and/or temporal disparities in 
real-time. It is comprised of two frame- 
synchronized cameras mounted on a servo 
controlled pan-tilt-verge head. The camera 
output is digitized and fed into a custom 
Laplacian of Gaussian (LOG) convolver board. 
The LOG convolved stereo output is then 
buffered on a high speed, parallel sign 
correlation board. A 68040 processor is used to 
control the operation of the LOG convolver, sign 
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correlator, and a motion controller for the pan-tilt- 
verge head. 

We have selected several goals for our stereo 
vision system which are intended to provide the 
greatest benefit to the robotics projects which we 
support. The list is representative of the goals 
described by the vision community in general 
which includes: tracking, object size and shape 
estimation, obstacle detection, and object 
recognition. In pursuit of these goals, our 
approach is largely based on real-time active 
vision principals. In addition, we are concerned 
with computational economy, modularity, 
reusability, and man-machine interaction. 

Motlon-Centrold Tracking 

Originally, our PRISM system included some 
application software for performing differential 
motion tracking. The tracking algorithm which 
was provided computes a monocular disparity 
vector, for a single image patch, from 
consecutive monocular frames (temporal 
disparity). The disparity vector (motion vector) is 
then used to update pan-tilt velocity control 
parameters. These parameters are fed to the 
motion controller which keeps the tracked object 
in the field of view. Vergence is maintained 
through stereo disparity matching and is 
centered about the same location as the motion 
correlation window. 

This scheme enables the system to track an 
object for a surprisingly long time (due in-part to 
the accuracy gained by the subpixel disparity 
measurements made possible through sign 
correlation) considering that disparity errors 
accumulate with each frame with no mechanism 
for re-centering on the object. Still, these errors 
eventually accumulate to the point where the 
attentive mechanism "slips off” an occluding 
contour of the tracked object and "gets lost" on 
the background. This scheme also proves to be 
very fragile when faced with rotating objects. 

Since the differential tracking algorithm is 
designed to tap the strengths of PRISM, its 
speed is adequate, but it lacks the robustness 
required for our purposes. Therefore, we 
developed an algorithm which combats the 
effects of rotation and integrated disparity error 
by re-centering on the object. After the motion 
and range disparity measurements, there was 
only cibout 1/5 of a frame period left for centering 
oriented measurements; thus it was necessary 
to stay true to our goal of computational 
economy. 


The centering algorithm (CA) we developed is 
simple yet robust. It takes advantage of the fact 
that the differential tracker typically maintains the 
object of interest within a shallow stereo disparity 
range. The advantage to limiting disparity 
measurements to a shallow range is that it 
greatly reduces the number of correlations 
necessary to search a given region of space. A 
determination of occupancy can be made quickly 
by checking one of these shallow regions for 
high (hits) or low (misses) correlation. The 
centering algorithm employs a grid of shallow 
disparity measurements (disparity grid) which 
can then be interrogated to find an approximate 
area centroid for the object. The CA simply 
biases the pan-tilt velocity command towards the 
centroid of the object acting, in effect, as an 
integral control term with respect to the 
differential tracker. 

The CA described above is almost qualitative in 
nature. Each measurement is binary, and a 
number of binary results are used to drive the 
attentive mechanism. This approach provides 
several advantages: each measurement is 
relatively low cost allowing for several 
measurements per frame; a number of 
measurements are averaged making the system 
insensitive to noise; and the measurements are 
well distributed, providing sufficient information 
to track even sparsely textured objects. 

As an object is tracked, its size and shape may 
change as its relative pose changes, revealing 
different portions of the object. In order for the 
centroid tracking algorithm to be general, it had 
to be designed to normalize with respect to such 
image transitions. Normalization is achieved 
through a low cost analysis of the occupancy 
distribution within the disparity grid. If the "hits" 
and "misses" are homogeneously distributed 
then the grid size is increased. If there are only a 
few "hits" and they are localized, then the grid 
size is decreased . The resultant behavior is that 
the grid constantly seeks to match the scale of 
the object K is tracking, a necessary precursor to 
maintaining center. 

A beneficial side effect of the grid size 
normalization mechanism is that the grid tends 
to expand as much as possible, thus it attempts 
to engulf the larger, more track-able portions of 
an object. For example, if a motion seeking 
algorithm "wakes up" the tracker when it is 
fixated on a person's hand, the disparity grid will 
tend to move the fixation point up his arm and 
eventually to his torso. Once fixated on the torso. 
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the tracker is very robust and it is difficuit for 
someone to "lose" it. 

Finally the disparity grid depth of field (defined in 
disparity space) had to be normaiized with 
respect to the object's distance. Otherwise, whiie 
tracking an agent which strays far away, the 
expanded depth of field may allow the object to 
blend into the background. This also requires 
little more than a single trig, computation. 

Depending on the task at hand, the centroid 
tracker can be infiuenced to fixate on specific 
regions of an object. For example, when tracking 
a human, a small velocity control bias in the +Tiit 
direction causes the tracker to reach equiiibrium 
on the head rather than the torso. We expect this 
skill to come in handy in the future when we will 
need to segment out the head and/or face for 
recognition purposes. 


Depth Tracking 

Centroid biased motion tracking has proven 
effective, yet expensive, in order to find the 
displacement of a correlation window between 
consecutive frames, it is necessary to tesseiiate 
(in effect) an area of the image with correiation 
windows. For animate objects such as humans, 
anticipated accelerations require a large array of 
correlations. Stereo disparity measurements, 
however, require only a one dimensional 
search, thus they are relatively inexpensive. 

In many instances, disparity segmentation aione 
is adequate for locating an object. For this 
purpose a depth tracking aigorithm was 
developed which again makes use of disparity 
regions. This algorithm utilizes a pyramid 
approach to search for a distinctive object by 
segmenting with respect to depth and cyciopean 
axis proximity. Again, the object centroid is used 
as the reference location. For tracking spatiaiiy 
distinct objects, the performance of the depth 
tracking algorithm (implemented on the PRISM 
system) is astounding, in fact, in this 
configuration, tracking accelerations are iimited 
by the mechanics/controi of the pan-tiit-verge 
head rather than the computational speed. 
Additional visual cues would be necessary to 
make this approach to tracking viable in cluttered 
environments. 


Contour Tracking 

For determining the shape and size of objects 
which cannot be contained within the camera 
field of view, it is useful to be able to trace, or in 
a more dynamic sense, track its contours. To 
achieve this, the author has developed a 
mechanism which employs methods similar to 
those used for tracking object centroids. The 
difference is that instead of avoiding object 
boundaries, it is desired that the disparity grid 
straddle them. If the system can be made to 
seek out and "stabilize on contours", then 
determining the "sense" of the tangent vector 
needed to track along the contour is a trivial 
matter. 

In order to develop a contour stabilizing 
algorithm using stereo disparity, it is necessary 
to contemplate the depth transitions 
characteristic of these contours. The depth 
gradient of a contour may be positive, zero, or 
negative with respect to the stereo coordinate 
frame. 

Occluding contours have special characteristics 
which readily lend themselves to stable tracking. 
A disparity grid of measurements straddling an 
occluding contour will have a distinct cluster 
(dark area) of "hits" on an object and a distinct 
region (light area) of "misses" off of the object 
edge. An algorithm which employs competing 
"forces" acting to keep these areas in balance 
has proven stable even when limited texture was 
available. Contour tracking is induced by moving 
along the perpendicular bisector of the dark and 
light regbns. 

The remaining question is "how to update the 
median disparity depth of field as the contour is 
traversed". This question is key because the 
disparity depth of field must be appropriately 
biased to ensure that it always encompasses the 
occluding edge. Failure to meet this criteria will 
not ensure that the occluding contour will be 
tracked accurately as its slopes towards or away 
from the vision reference frame. For occluding 
contours, the appropriate bias is found by 
averaging disparities near the perpendicular 
bisector. The trend indicated by the depth 
differential between these points, and points on 
the interior of the edge, provide a clue about the 
slope of the occluding contour. The disparity 
depth of field must be biased in the direction of 
this slope. 

This scheme has proven to be very successful 
for tracing out distinct objects (spatially 
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separated from other objects) including 
cardboard boxes (poorly textured), a robotic 
manipulator (wires and all), and several humans. 
The algorithm, as implemented on the PRISM, 
includes some dynamic "optimization" which 
nwdulates the head control velocity based on 
contour detectability and smoothness. The 
system can now trace out an average sized 
human, standing seven feet away, in about 
twenty seconds, although, it currently cheats in 
order to find closure when it reaches ground 
level. 

Interior contours present a slightly more difficult 
depth segmentation problem and therefore 
present a greater challenge for gaze 
stabilization. Several approaches to real-time 
interior contour tracking are under development. 

Conclusion 

The benefits of using local disparity regions for 
coarse grain depth segmentation has been 
discussed. Several successful stereo vision skills 
which utilize these measurement techniques 
have been described. 

Our PRISM system is scheduled to be integrated 
with a Cybermotion platform. We intend to use 
the visual skills described above for guiding the 
robot to follow humans, walking at a natural 
pace, through cluttered environments. 
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Abstract 

We present a hard real-time software architec- 
ture whicli enables two kinds of safety within 
a single system: the safety of flexibility and 
robustness, and the safety of guaranteed tim- 
ing. The architecture combines a) online intelli- 
gent synthesis of courses of action, with b) pre- 
cise timing and very high speeds in the perfor- 
mance of such automatically synthesized plans. 

A fundamental component is a negative feed- 
back loop from available computing resources 
to computational demands, which ensures both 
that online decisions are feasibly performable, 
and t hat the available computing resources are 
used lo best advantage. This feedback loop al- 
lows the architecture to respond to degraded 
sensor systems by scheduling alternative com- 
putations without accidentally impairing the 
timing of other tasks; allows to respond to de- 
graded computers by scaling back system activ- 
ities to maintain minimal standards of perfor- 
mance, e.g. safety; and allows to dynamically 
exploit excess computing power for improved 
])erformance. 

The architecture, though still subject to limits 
on the achievable robustness, can nevertheless 
be expected to out-perform systems in which 
computer power is allocated off-line. We briefly 
describe two robotic subsystems which can not 
be built safely without our architecture.^ 

1 Motivation 

1,1 Two Control System Problems 

Some NASA applications, such as the intelligent man- 
agement of hardware faults in Space Stations and space- 
craft, and the use of intelligent mechanisms (e.g. robots) 
about the Space Station and on Mars, require both flex- 
ibility and hard real-time execution. Two cases in point 
are the intelligent control of ‘‘redundant'’ robots and the 
dynamic reconfiguration of perception processing, both 
of which we are currently implementing. 

H'opyright (c)]99.3 by Marcel Schoppers. Published by 
the American Institute of Aeronautics and Astronautics Inc. 
with i>erinission. 


With only 6 degrees of freedom a robot arm can 
reach any posture in exactly one way, and this sim- 
plicity makes mathematical motion analysis easy. The 
NAS A/ JSC robot called the Extra- Vehicular Activity 
Helper/Retriever (EVAHR) has 20 degrees of freedom: 
7 in each of the two arms, and 6 in the body. As a result 
there are an infinite number of ways for the EVAHH to 
get one hand to a particular posture. I'he extra (“redun- 
dant”) degrees of freedom greatly complicate the motion 
control problem. Our approach to the control of such a 
“redundant” robot takes dynamical limits into account, 
validates the robot’s expected motion, and also allows 
it to start moving immediately. This is accomplished as 
follows. 

(1) Our Artificial Intelligence (AI) software uses a (piali- 
tative kinematic model to choose a sequence of postures 
for the robot to achieve [Jung and Badler, 1992]. This 
sequence of postures deals with robotic “redundancy” in 
roughly the way humans would: to reach under a bed, 
one gets down on one’s knees, rests on all fours, lowers 
one’s head, and stretches out one arm. Each interme- 
diate posture is then treated as an attractive potential 
field. The attractive fields, together with their re|uilsive 
counterparts (obstacles) drive motion dynamics in the 
usual way (after [Khatib, 1986]) thus striving to avoid 
collisions and ensuring that the motions are compable 
with the robot’s force and torque limitations. A super- 
real-time motion pre viewer then uses these potentials to 
preview the resulting motion before it is performed, and 
so verifies that the motion will avoid botli collisions and 
potential wells (local minima). 

(2) For this scheme to work, there must be a careful 
synchronization between trajectory previewing and the 
actual motion, lest the actual motion get into space Uiat 
has not been previewed, with potentially dangerous con- 
sequences. Indeed, for given motion speeds and acceler- 
ations the previewer must be executed with a frtHpn'iicy 
lying within a limited band [Schoppers, 1996]: if the pre- 
view^er is executed too frequently it won't have lime to 
preview^ a fail-safe trajectory; if it is executed too infre- 
quently the robot can accelerate ahead of it. 

The preceding two paragraidis taken togf'ther signal 
trouble. Paragraph (1) says that we need an Artificial In- 
telligence (Al) program to decide, on-line, how the robot 
should move, by choosing a setjnence of postures. Para- 
graph (2) says that mot ion- related computations must 
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he carefully timed. These two requirements have been 
incompatible for many years. 

(3) Our second robotic subsystem provides a new degree 
of system robustness and survivability for robotic vision. 
We take advantage of the presence of several types of 
vision sensors on the EVAHR robot by dynamically se- 
lecting available sensors and perception algorithms, thus 
improving the probability of system survival despite sen- 
sor malfunctions. 

(4) Robotic vision is part of the pipeline from sensor in- 
put through state estimation to motion control. Since 
data reaches the robot's effectors some time after the 
data was first sensed, the control system must compen- 
sate for the delay by predicting where things will be 
when the intended actions are finally performed or com- 
pleted. If any part of the sensor-input- to-action-output 
pipeline is mis-timed, the control system’s predictions 
will be wrong and damage may result. 

Here too, the combined implications of paragraphs (3) 
and (4) are problematic, but for a different reason. Per- 
ception can be implemented without resorting to AI al- 
gorithms; but because the computing time required to 
interpret different sensors differs widely, dynamic recon- 
figuration effectively makes the computing time of the 
whole perception system unpredictable. Once again, the 
conflict between system flexibility and real-time execu- 
tion is holding up an otherwise good idea. 

1.2 The Core Problem 

It is a very difficult problem to implement systems that 
are at once flexible in the sense of adaptivity and robust- 
ness, and hard real-iime in the sense of timely execution 
of actions. To show why this combinations of require- 
ments is so difficult, this subsection presents a careful 
analysis of each requirement. 

“Flexibility” in a machine is as apparent as its make- 
up is subtle. The dark background against which flexi- 
bility stands out in sharp relief is the problem of domain 
complexity. We explain this with reference to automated 
diagnosis. In principle, diagnosis could be done by com- 
piling a large decision table that associated combinations 
of symptoms with faults. As the number of possible 
faults and symptoms grows, however, the number of pos- 
sible combinations of faults (and symptoms) grows astro- 
nomically. The programmer is inevitably overwhelmed 
by the complexity of the domain, and so fails to antici- 
pate some of the possibilities and to properly understand 
others. The result is software that makes false assump- 
tions and sometimes behaves inappropriately. Flexibil- 
ity, then, stands out as the ability to go on behaving 
appropriately even when the number of possible situa- 
tions grows beyond the reach of human forethought. 

The fundamental justification for Artificial Intelli- 
gence (AI) algorithms is their flexibility as just defined. 
Flexibility may be necessary for many reasons, includ- 
ing: the environment may be unpredictable, or sub- 
systems may malfunction, or the environment may be 
entirely predictable but very complex. In such cases, 
AI algorithms can do on-line the problem solving that 
could (in principle but not in practice) have been done 
by a programmer off-line. Not only are AI algorithms 


more economical in terms of programming effort, they 
are also safer because the responses devised by the A I 
software can take dynamically occurring factors into ac- 
count more thoroughly, and the resulting system is likely 
to generate appropriate behavior for a iiiuch wider vari- 
ety of the possible situations. 

We now turn to the requirement of real-time execu- 
tion. “Hard real-time” execution is necessary whenever 
hardware must be controlled safely. In general, a compu- 
tation is called “real-time” when the correctness or oilier 
value of a computation depends not only on what daia 
or action the computation produces, but also on the time 
at which that data or action is produced. For example, 
there is a particular moment in time beyond which any 
computing about collision avoidance is no longer useful 
because a collision is no longer avoidable. More often 
than not, the timing of computations is a critical factor 
in total system safety. The field of hard real-time com- 
puting research is working on ways of guaranteeing that 
the timing of computations is correct. 

It would be nice if we could build systems that were 
both safe on account of the flexibility built into them 
with AI algorithms, and doubly safe because their com- 
putations were guaranteed to be completed on time with 
the available computing power. But this coujunction is 
a difficult one. The pivotal nroblem was first clearly de- 
scribed by [Paul et al, 1991] as follows: 

The timing of actions taken by a real-time sys- 
tem must have low variance, so that the effects 
of those actions on unfolding processes can l>e 
predicted with sufficient accuracy. But intelli- 
gent software reserves the option of extended 
searching, which has very high variance. 

Because AI algorithms — like people solving dilRcult 
problems — generally reach conclusions by making plau- 
sible guesses that may turn out to be wrong any num- 
ber of times (“searching”), their total execution time is 
highly unpredictable. To make ma tters even worse, A I 
software in control applications often dynamically varies 
the tasks being carried out, thus dyiiamically changing 
the structure of the computation. When there are n com- 
putations a system could execute, there are 2’’ |>ossil)le 
subsets, each with its own execution time, so that again 
the total system’s computational load is highly variable. 
Both sources of unpredictability make it very difficult 
to be sure that the total system can perform in hard 
real-time. 

Thus it was necessary, until recently, to choose l>e- 
tween the safety provided by hard real-time performance, 
and the safety provided by on-line automated decision 
making. Systems such as intelligent robots or space sta- 
tions, which require both, were specifically beyond (he 
state of the art. 

1,3 A General Solution 

We have undertaken to provide both kinds of safety si- 
multaneously by means of a novel software architecture. 
Our architecture, which has been worked out in detail 
but not yet implemented, exploits a specific set of hard- 
real-time techniques that have become available only in 
1992 and 1993. 
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Previous attempts by others to integrate hard real- 
time performance and on-line automated decision mak- 
ing were of two kinds. One tried to force A I software 
into a real-time mold by limiting on-line searching, thus 
also eliminating most of the flexibility. The other al- 
lowed the AI software to compute as long as it liked, 
and tried to design the real-time part of the system to 
keep the whole system out of trouble until (indefinitely 
much later) the Al software communicated a decision to 
the real-time subsystem. This allowed flexible behav- 
ior until the real-time part of the system stumbled into 
unfamiliar territory with the AI software still thinking 
about the past. 

Our approach is an elaboration of the circa archi- 
tecture [Musliner et al, 1993]. The AI software doesn't 
merely send new parameters to a fixed real-time sub- 
system, it dynamically reprograms the whole real-time 
subsystem, and simultaneously plans total system behav- 
ior to ensure that the real-time program will be able to 
cope with the chosen future. This produces such inter- 
esting phenomena as robots slowing down to ensure that 
their real-time programs will not be overloaded — the 
AI software now can shelter the real-time subsystem and 
can also buy itself time to think. In short, our approach 
imposes a negative feedback loop from exce.ssive compu- 
tational loads to less demanding system behavior. Fur- 
ther, the real-time subsystem is no longer cast in stone 
but can be made to adapt to its context, and to perform 
procedures that were automatically constructed. 

We were careful to design our software architecture 
in an application independent way, so that the solution 
would be suitable for use in everything from robots, to 
space stations, to interplanetary spacecraft, to Lunar 
and planetary bases — with a promise of enhanced safety 
in all cases. 

Having resolved the central flexibility/real-time con- 
flict, we intend to demonstrate the great value of our 
software architecf.ure by having it enable two novel ap- 
idications, namely the dynamical control of robots with 
many degrees of freedom, and the dynamic reconfigura- 
tion of a multi-sensor fusion subsystem. Our sul)system 
for recon figurable multi-sensor fusion will dynamically 
choose whatever sensors and perception algorithms it 
likes, and the robots physical behavior will adjust to the 
dynamically changing computing load. Our subsystem 
for intelligent control of the motion dynamics of robots 
with many degrees of freedom will be a large advance 
on current robot control technology. It will rely on our 
general software architecture for proper timing of motion 
previewing relative to actual motion, and for hard real- 
time computing in the presence of AI posture planning 
algorithms. Both of these applications are impossible to 
implement safely without our software architecture. In 
enal)ling feedback from computing load to safe behavior, 
our software architecture will automatically and dynam- 
ically determine the maximum speed at which a robot 
can safely move, even when some of the robot's comput- 
ers and/or sensors are malfunctioning or disabled. As 
a result our architecture will also support sensor pro- 
cessing reconfiguration in a completely safe and general 
way. Both modules will demonstrate the value of our 


architecture for improved adaptiveness and survivability 
of complex control systems. 

1.4 Our Approach In Perspective 

The Artificial Intelligence (AI) community has come to 
address the requirements of real-time systems in three 
ways [Durfee, 1990] . When building a system that must 
act in real-time as well as reasoning, one can: 

• Subject the AI component of the system to hard 
deadlines (e.g. anytime algorithms). Under time 
pressure, this results in truncation of intelligent 
function. 

• Allow the Al component to think freely, make the 
real-time subsystem responsible for total system 
safety, and have the Al component re-j)arameterize 
the real-time subsystem with whatever guidance the 
AI subsystem can produce in time. Under time pre.s- 
siire, this results in intelligent function l>eing left far 
behind the rush of events. 

• Refuse to subject the A I component of the system 
to hard deadlines, but let the A I component dy- 
namically reprogram the real-time subsystem with 
a program realizing a discrete-evnit control law that 
preserves closed-loop stability with sufficient jvhust- 
ness for the period of time tn ivhich the A I subsystem 
is deciding what to do next. This approach remains 
functional even under time pressure — almost l>y 
definition. 

We regard the NASREM architecture [Albus et al, 1987] 
as embodving the first approach, and the architeriures of 
Bonasso [Bonasso, 1991; Bonasso and Slack, 1992] an<l 
Gat [Gat, 1992] as embodying the second. We favor 
the third, in which we have imposed control- theoretic 
criteria upon the ideas of [Musliner et al, 1993]. The 
resulting architecture has the following advantages: 

1. the AI software can remain intelligent and flexible; 

2. the real-time subsystem need not be progranuiKMl 
(at system design time!) to be competent against 
all possible contingencies; 

3. the A I software can reprogram the real-time sub- 
system to prevent computing overloads before t hey 
happen (e.g. with slower motion), thus pro-actively 
buying itself time to think; 

4. the program down-loaded into the real-time sub- 
system can be as flexible as the Al software can 
make it (e.g. in a robotic a|>plication the maxi- 
mum safe speed can now be a function of compu- 
tational load, and so will l>e higher than the worst- 
case limit nearly all the time, allowing sul)stantial 
performance gains); 

5. the real-time subsystem can be a small operating 
system, time-slicing tasks or threads with widely 
differing frequencies, in contrast, to the many mo- 
bile robot controllers in which all computations are 
executed with the same frequency. 
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2 Architecture for Flexible Real-time 
Control 

2.1 Description of Architecture 

Figure 1 shows the architecture we have designed on the 
preceding foundations; this Figure will serve as a guide 
for the ensuing discussion. 



Figure 1: Architecture Diagram. 


There is assumed to be a fixed control program which, 
in our architecture, is split into two parts. One part, 
usually called the control law or plan, is modified so 
that its execution or interpretation affects only a. simu- 
lation model of the controlled system. The other part 
determines where simulation should begin and end, and 
is called the “plan simulator’'. 

The basic idea is that the plan simulator chooses a set 
of initial and goal states, simulates the plan or control 
law over several possil)le futures, collects the real actions 
appropriate to the anticipated states of the controlled 
system, and so dynamically constructs a small real-time 
program, which it downloads to the real-time executive. 
From then on, the real-time executive takes care of com- 
municating with the decoupler, sensor readers, and de- 
vice controllers, in hard real-time. This mitigates time 
pressure on the plan simulator, which can proceed with 
the construction of a new real-time i)rogram. This sepa- 
ration of timing concerns is indicated in the architecture 
diagram by the dividing line between soft real-time and 
hard real-time. 


Each new real-time program specifies: a set of func- 
tions implementing actions on the controlled system: a 
set of functions which test the estimated state of the con- 
trolled system; the conditions under which each action is 
appropriate; and maximum allowed delays between sat- 
isfied conditions and appropriate reactions. 

Tn practice, each real-time i>rogram is a set of tasks or 
threads^ that can be run, suspended, resumed, and so 
forth, under control of the real-time executive. Threads 
may be periodic, being executed cyclically and with a 
definable frequency; or they may require execution only 
sporadically (e.g. interrupts), in which case there may be 
a maximum allowable delay between the time the thread 
becomes relevant and the time it is actually executed. 

The real-time executive comes with a “schedu I ability 
check” designed specifically for that real-time executive. 
It is used by the plan simulator to test whether each new 
real-time program is indeed executable in hard real-time. 
A successful schedulability check absolutely guarantees 
that no thread will miss any deadline specified for it. 
A failed schedulability check tells the plan simulator to 
design a less demanding real-time program, and if nec- 
essary, to choose a less demanding future. 

The “actions” performed by the real-time executive do 
not themselves drive effectors or sensors. Instead, they 
send on/off signals and parameters to the effectors’ and 
sensors’ servo/dri ver loops, which often run on dedicated 
microprocessor chips. 

Just as servo loops may take extended amounts of time 
to reach a setpoint, task-specific A I modules may take 
extended amounts of time to make a decision. Just as 
the real-time executive is controlling sensor i-motor servo 
loops by updating their parameters and turning them 
on and off, it also turns task-specific A I modules on and 
off according to the needs of the moment. That’s why 
Figure 1 shows a typical task-specific A1 module below 
the real-time executive, with the device controllers. 

The real-time executive, plan simulator, servo loo|)s, 
and all A I computations are threa<ls which operate in 
parallel by default. The fraction of computing i>ower 
(CPU cycles per second) available to each threaf.l is 
carefully decided by the plan simulator and enforced by 
the real-time executive. That is, the real-time |>rogram 
constructed by the plan simulator may in elude actions 
that adjust (even to 0) the frequency with which cert ain 
threads are executed. 

As a special case, the real-time program may set even 
the plan simulator’s execution fre(|uency to 0. Even 
though the real-time executive depends on the now- 
stopped plan simulator for updat^^d real-time ]^rograms, 
this situation does not mean that the system is perma- 
nently stuck with the existing real-time program, since 
the existing real-time program is conditional, and W'ill 
have been designed so that new circumstances cause the 
execution of actions that allow’ the plan simulator t o !>"- 
Slime. (If the real-time executive’s control over A I mod- 
ules is reason for Figure i to show the latter l.)elow the 

^A thread is a function that has its own piece of program 
stack, so that execution of the function can he suspended 
and resumed independeutly of the execution of otJier similar 
threads. 


771 









forniei, perhaps the plan simulator should also be shown 
below the real-time executive.) 

Executed as “background'’ threads, AI modules can 
take as long as they like, and provide the system with 
t he full power of on-line intelligent reasoning without en- 
dangering the operation of time-critical system elements. 
At the same time, the plan simulator can ensure, by 
proper design of the real-time program, that such AI 
modules are not left behind the rush of events: if the 
decision of an AI module is crucial to future activities, 
the |)lau simulator can program the real-time executive 
to take actions that delay the on-coming future. This re- 
sembles the idea of designing a “reactive subsystem” to 
ensure total system safety while the AI software thinks 
(see above), with the important differences that in our 
case (a) the “reactive subsystem” is not fixed at system 
design time, and (b) the AI software’s thinking may be 
aborted and redirected by new events. 

2.2 Control Theoretic Requirements 
In our architecture. 

The cautomatically constructed real-time pro- 
gram is a partial control law. The real-time ex- 
ecutive, in its performance of that control law, 
functions as a controller. 

The hardware devices being controlled will in general 
have dynamical behavior. Ideally we would like the real- 
time program to take care of all hard real-time processes, 
so that the plan simulator itself does not have to worry 
about real-time deadlines. To this end we qualify the 
CIRCA apjuoach with a stipulation that the real-time 
program must drive the controlled system into a set of 
goal states, and then must maintain the controlled sys- 
tem within that set of goal states for an indefinite pe- 
riod of time. Such behavior on the part of a controller is 
known as “closed-loop stability.” For the case of discrete- 
event control systems, we define it as follows: 

A discrete-event system is closed-loop stable, 
with respect to a set of goal states, if the con- 
troller is able to drive the controlled system so 
that it actually reaches a goal state in finite 
time, and then (in the absence of disturbances) 
stays within the set of goal states indefinitely. 

(Of course, along the way toward a goal state the real- 
time program may also decide to do nothing for a while, 
knowing that a device will do a desirable thing without 
being forced to do so.) In sum, we require, as a key 
property of our architecture, that 

Each automatically synthesized real-time pro- 
gram (discrete-event control law) must make 
the controlled system closed-loop stable. 

When the controlled devices are subject to distur- 
bances (i.e. dynamics that are unpredictable) the dis- 
turbances may throw the .system out of the set of goal 
states, and we expect the real-time program to drive the 
controlled system back to a goal state. This too is a 
familiar notion in control theory: 

A control law is robust to the extent that it 
can keep the controlled system closed-loop sta- 



Figure 2: Controlling via Stable Subdomains. 


ble despite the occurrence of disturbances and 
unmodelled dynamics. 

Thus we desire that our automatically .synthesized real- 
time program should be as robust as possible. 

Figure 2 depicts the plan simulator's repeated revision 
of the real-time program, along with the latter's efiect 
upon the controlled system. The closed- loop stability of 
the controlled system is represented l)y the “attractor 
states” into which the system eventually settles. The se- 
quence of attractor states represents the plan simulator's 
repeated selection of goals. The robustness of each real- 
time program is represented by the fact I hat closcd-loo|) 
stability is achieved despite unpredictable ineand^uings. 
The meanderings are rectangular rather than smooth, to 
emphasize that the real-time program encodes a discreie- 
event control law, not a continuous-variable control law. 

In constructing a real-time program, the plan simula- 
tor must choose goals and must antici|>ate enough pos- 
sible futures to achieve closed-loop stability and robust- 
ness. In practice, in discrete-event control applications, 
finding a set of goal states that allow closed-looi) sta- 
bility is much easier than in continuous- variable control 
applications. Chips can be made to stay on tables, doors 
can be made to stay open, buildings can be made to 
last, and so on. In AI the standard counter-examples 
to such stabilities are “spontaneous proces.se.s” and the 
actions of other agents, but remember that for our |iro- 
posed architecture to work, the real-time program need 
only be closed-loop stable. This means that the devices 
being controlled can act to prevent unwanted interfer- 
ence. Thus the real-time program could prevent other 
agents from closing doors it wanted kept open, ami so 
forth. Under these conditions, discrete-event closed-loop 
stability is not hard to find. 

Achieving robustness is more dilficult, especially for 
discrete-event control applications. The problem is that 
unpredictable or unmodelled disturbances may throw 
the controlled system into so many different slates that 
it may be impossible to synthesize a real-time program 
that can respond to all of them within tlie (TU and/or 
memory resources provided by the real-time executive. 
Consequently, making a discrete-event control law ro- 
bust is a fine art. The best that can be done is to include 
“important” disturbances within the domain model, and 
further, to include into the model such information as 
the likelihood, outcome, and severity of a possible dis- 
turbance. The availability of that information will allow 
the plan simulator to decide on-line which disturbances 
should be anticipated by the real-time control program 
being constructed, in order to make that program as ro- 
bust as possible within the available resource limits. 

The properties of closed-loop stability and robustness, 
which must hold of the real-time program being down- 
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loaded to the real-time executive, are shown in the ar- 
chitecture diagram (page 4) as being the responsibility 
of the plan simulator. There we have referred to “any- 
resource"' robustness to indicate that, once closed-loop 
stability has been built into the real-time program, the 
plan simulator may apply whatever resources are left 
over towards improvements in robustness. 

There remains the question of what should happen if, 
desj>ite the construction of a feasibly robust real-time 
program, the controlled system makes a low probability 
transition into a state for which the real-time program 
has no response. A number of solutions are possible. In 
the CIRCA approach that problem was passed back to 
the plan simulator. Since that solution makes system 
stability dependent on the response speed of the plan 
simulator, we distrust it. An alternative is to have the 
plan simulator design the real-time program so as to an- 
ticipate all device states that must be controlled within 
some time horizon, or within hard deadlines; in this way 
the plan simulator will have a minimum amount of time 
with which to build a replacement real-time program, 
and will need to deal only with soft deadlines. This al- 
ternative works only if the plan simulator's model of the 
domain dynamics is correct. Our proposed solution is 
to have the real-time program include alternative sub- 
programs, along with instructions for their contingent 
deployment. This solution can work if the real-time ex- 
ecutive’s execution-time resource is more constraining 
than its memory space resource — and that is likely. 

2.3 Relation to 3-Level Architectures 

The architectures of [Firby, 1989], [Boiiasso, 1991; 
Bonasso and Slack, 1992] and [Gat, 1992] specify three 
“levels” , namely 

1. a “reactor” containing servo loops, safety reactions, 
and behaviors; 

2. a “sequencer’- for deciding which specific activities 
are needed both now and in the near future; 

3. a “deliberator” which contains AI software for plan- 
ning, diagnosis, metalevel reasoning, and so forth. 

These three levels resemble our hard real-time executive, 
our soft real-time plan simulator, and our non-real-time 
AI modules, respectively. None of the cited authors took 
timing seriously, however. In the reactors of their mobile 
robots, reaction threads for avoiding collisions run every 
so often, but there is no guarantee whatever that collision 
avoidance will always be performed in timely fashion. 
Their programs may work properly for very long times, 
but ultimately it is impossible even to know whether the 
robots have ever endured worst-case logjams of threads 
competing for execution time. Hence we have replaced 
their reactors with our hard real-time executive, which 
takes timing very seriously indeed. 

A second important difference is we have changed the 
locus of control. In both [Firby, 1989] and [Bonasso 
and Slack, 1992] control was hierarchical: the deliber- 
ator was on top and in control, deciding what the se- 
quencer should do, and the sequencer decided what the 
reactor should do. This was changed by [Gat, 1992], 


who put the sequencer in control and reduced the delib- 
erator to computing task-specific parameters like motion 
trajectories. The major reasons for this were that (1) re- 
action should not be kept waiting for deliberation, and 
(2) old deliberations may be irrelevant to new situations, 
so the sequencer should get to initiate and terminate de- 
liberations. The locus of control changes again for our 
architecture: we are moving it all the way down into t he 
reactor (a.k.a. the real-time executive). The reasotting 
for this is that the results to be computed <.lepend on the 
situation at the time; the amount of computing power t o 
devote to hard real-time, soft real-time, and background 
threads depends on what needs to be computed, and 
hence on the situation at the time; but the situation can 
change very rapidly, hence the allocations of computing 
power and the threads being executed may also have to 
change very rapidly; and so the changing allocations and 
the initiation and termination of all threads (including 
deliberations) must be handled by the real-time exec- 
utive. ([Kaelbling and Rosenschein, 1990] also have a 
fast reactive system determining which deliberations are 
relevant and for how long.) 

Third, our real-time executive is a real-time micro- 
kernel running threads at multiple frequencies, not a sin- 
gle loop that performs a fixed list of functions all at the 
same frequency. 

3 Implementation of the Real-time 
Executive 

Any real-time subsystem adopted as the basis of our ar- 
chitecture should provide the following minimum capa- 
bilities: 

Rl: insulating the processing time allocated to hard 
real-time threads from the cycle-stealing desires of 
all other threads. 

R2: testing, as part of program execution ami without 
modifying the set of currently executing hard real- 
time threads, whether a proposed new set of hard 
real-time threads is such that all its deadlines can 
be met with the available processing time. 

R3: conditionally switching I hread subsets on and off, so 
that the real-time executive is effectively executing 
a conditional real-time program. 

R4: allowing dynamic modification of the hard real-t ime 
program (the set of hard real-time threads and the 
switching logic) while still meeting all deadlines be- 
fore, after, and during the modification. 

R5; ability to do R1-R4 when the set of hard real-1 ime 
threads contains both cyclic (periodic) and sporadic 
(aperiodic) threads. 

These requirements can be met in a variety of ways. 
The alternatives are 

1. multiprocessor schedulers, which are the most pow- 
erful but expensive in proportion; 

2. earliest-deadline-first schedulers, which are the niosl. 
user-friendly; 

3. rate monotonic schedulers, wdiich are the easiest to 
build. 
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We believe that our architecture could be implemented 
equally well on top of any of these options. However, 
the i)udgetary constraints on our work incline us away 
from multiprocessor schedulers (while yet we also be- 
lieve that for such complex systems eis space stations or 
lunar bases, multiprocessor schedulers are by far the best 
choice). The best fit with our work on the EVA HR robot 
is an earliest-deadline-first scheduler, since such sched- 
ulers facilitate dynamically changing thread frequencies 
(for synchronizing computing with robotic motion) and 
conditional schedules. Our intejided implementation is 
similar to that of [Ramamritham and Stankovic, 1984] 
with the schedulability check of [jeffay, 1992]. 

4 Discussion 

4.1 Benefits of the Architecture 

The final justification for our architecture consists of 
three words: safety, generality, and economy. Safety: 
because the timing of program execution is taken so se- 
riously that even intermittent timing problems can not 
survive, and because the architecture establishes a feed- 
l>ack loop from coniputiiig load to graceful degradation 
of task performance. Generality: because the system 
can contain on-line intelligent reasoning, can enact the 
results of such reasoning in hard real-time, can dynam- 
ically replace the system s hard real-time reactions, and 
can judiciously control its own use of computing power. 
Economy: because a fixed amount of computing hard- 
ware can be time-shared and carefully allotted. Here we 
elaborate on just a few of these benefits; the complete 
list appears in [Schoppers, 1993]. 

♦ Spatial synchronization. The ability to dynamically 
modify the frequency of thread execution allows to 
synchronize computing with motion through space. 
With collision checks being made at regular dis- 
tance intervals, slower motions require less calcula- 
tion. Knowing such facts about itself, an intelligent 
real-time system can reduce the speed of its mo- 
tions for the sake of reducing the processing power 
devoted to motion- related threads. 

♦ Graceful degradation. The ability to scale back its 
operations as necessary to ensure timeliness elimi- 
nates the need to design to an imaginary worst case 
scenario, because there is no longer a sharp perfor- 
mance cliff that the system can fall off in unpre- 
dict ably disastrous ways. 

♦ Reconfigurability. A survivable system must have 
several ways of achieving the same result. When the 
sensor normally used to deliver a given datum mal- 
functions, another can be used. Since the computing 
time recpiired to interpret different sensors differs, 
such result-compatible reconfiguration is only safe 
in systems (like ours) that can plan their behavior 
to match their planned computing load. 

4.2 Limits of Robustness 

Despite our concern for hard real-time and for dynam- 
ically achieved robustness, some kinds of mishaps can 
still happen (of course). If enough sensors malfunction, 


a robot will be unable to see new dangers approaching, so 
cannot be held responsible for avoiding them. Similarly 
a robot may, for the sake of getting its job done, have 
to place itself in situations that would be dangerous if 
the robot's computers suddenly died. In all other cases, 
however, including robotic inability to go on sensing ol>- 
jects it already knew about, as well as comi>iiter failures, 
our software will be aware of the potential mishaps and 
will continuously and intelligently redesign the robot's 
behavior specifically to optimize first the saOdy, then 
the performance, of the robot in its surroundings. 

4.3 When Is Hard Real-time Important? 

The development of our architecture was driven i>rimar- 
ily by concern for hard real-time response des|>ite the 
presence of A1 software. It is unclear to many people 
why timing should be taken so seriou.sly. The most com- 
mon objections are (1) Couldn't we hand-code a fixed 
layer of real-time reactive behaviors that take care of 
everything (e.g. collision avoidance) while the A I soft- 
ware is thinking, and (2) Couldn't we make sure that 
the real-time software works, by means of a test-debug 
cycle? Sometimes yes, but also sometimes no. 

Objection (1) is usually raised by people who have 
programmed wheeled mobile robots on earth. For such 
robots there is a small repertoire of actions that can en- 
sure robotic safety, e.g. slamming on the brakes or mov- 
ing away from impending collisions. However, as soon as 
either the robot or its environment becomes more com- 
plex, a fixed “reactive safety layer" no longer sulfices. 
One example is the Adaptive Suspension Vehicle (ASV') 
[Payton and Bihari, 1991], which was the size of a l>us, 
with six legs that were each 6 feet high at the hip. Main- 
taining stable balance while keeping the legs away from 
each other and while switching between gaits require<l 
a super-real-time motion planner. For the ASV, even 
stopping was so complicated that no predetermined set 
of “reactions" could have surPiced. Alternatively, more 
complex environments also prevent a hand-written safety 
layer, since such a layer must assume that its actions are 
easily reversible and will not themselves lead to new dan- 
gers. On orbit, however, a free-flying robot's action (o 
avoid a collision will move the robot into a new orl)it 
from which it may be both time-consuming and fuel- 
consuming to return, and on which it is still flying at 
approximately 20,000 miles per hour. In general it is 
not true that all robots can be kept safe forever with a 
fixed set of hand- coded reactions. 

Objection (2) cannot be sustained if a thread's missed 
deadline can lead to loss of human life. Since exi>erienced 
programmers know better than to claim that they've 
found “the last bug", and since concurrent software is 
worse than most, the test-debug approach may well yield 
life-threatening software [Stankovic, 1988]. The risks 
can be diminished by ap|)lying existing hard real-time 
scheduling research. Olyjection (2) is also relnittf'd if a 
system's worst-case computational load is several linu's 
higher than the average load. For example, selting a 
robot's top allowable speed to avoid timing j)robl('ms 
under a rare scenario will also limit the robot's ]>erfor- 
inance at all other times. Our architecture allows to 
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adapt, the robot's top speed to current computational 
loading. Here too it helps to take hard real-time seri- 
ously. 
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Abstract 

In this paper we have discussed some of the 
issues involved in planning utilizing a temporal 
reasoning system. One of the advantages is that 
of being able to handle incomplete information. 
In these circumstances, there may be multiple 
plans available for achieving a task. Using an 
algorithm, designed recently by us, generating 
all feasible plans may be practicable in most 
of the instances, although the problem is NP- 
complete. The significance of having all feasible 
plans in a plan data base is quite important. We 
have also discussed these issues here. 

I. Introduction 

Temporal reasoning is becoming an impor- 
tant tool for planning[4, 5, 11]. Although it 
is not very easy to represent planning problem 
eis a temporal reasoning problem, the advan- 
tage of doing so is immense. In this paper we 
have discussed one of such advantages. Tem- 
poral reasoning allows one to represent incom- 
plete information between operations and other 
primitives in a planning problem. This is a step 
forward from blocks world problem towards re- 
alistic planning. Under such uncertainty, there 
may be more than one plan which is feasible. 
Using an algorithm devised by us[8], all such 

* Graduate Student 
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feasible plans can be found out. The signifi- 
cance of the availability of such a complete plan 
data base is discussed in this article. 

Interval based temporal reasoning scheme 
gives planning activity the advantage of having 
some parallely executable operations. In this 
paper we have used this interval-based tempo- 
ral representation scheme. The price of having 
higher expressiveness in interval-based scheme 
is that the problem of handling incomplete in- 
formation is NP-complete. Our algorithm is an 
efficient one, and under most of the circum- 
stances it should be able to handle the prob- 
lem in acceptable time, although the worst case 
growth rate remains exponential. 

II, Planning and Temporal Reasoning 

Planning is the task of choosing a subset 
from a given finite set of operations and order- 
ing them in time. Each operation has a set 
of precondition and postcondition states of the 
world. Preconditions are states required for an 
operation to become executable, and postcondi- 
tions are states created by its execution. Given 
a set of start conditions and a set of goal con- 
ditions, the problem of planning is to see that 
chosen sequence of operations change the states 
of the world from start conditions to goal con- 
ditions. This perspective of planning is akin to 
a state change-based point of view in tempo- 
ral reasoning, as in situation calculus. A dif- 
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ferent approach had been taken by Allen et 
al[2] about planning where they have tried to 
formalize planning with more explicit temporal 
reeisoning. According to the conventional view, 
temporal identities, both operations and states 
of the world (or fluents) y are related to each 
other in such a way that the end of one is the 
starting point of the other. In interval based 
temporal reasoning scheme this latter temporal 
relation is the ‘meet’ primitive[l]. There are 13 
such other primitive relations feasible between 
two time intervals. 

Classical planning, based on situation calcu- 
lus, suffers from two shortcomings in their ap- 
plication in the real world. Firstly, it can not 
tackle a problem when two operations need to 
be performed at the same time to achieve an 
objective[3]. For example, to open a door one 
needs to turn door knob and push the door at 
the same time. This can not be easily repre- 
sented in conventional planning schemes. Even 
under the circumstances where such parallel op- 
erations are not essential, such plans may exe- 
cute faster (subject to the availability of suffi- 
cient resources for parallel execution of opera- 
tions). Secondly, any incompleteness of infor- 
mation about the real world can not be repre- 
sented in classical planning scheme. For exam- 
ple, one can make a phone call ‘before’ the class 
or ‘after’ the class, this flexibility of informa- 
tion can not be represented there. An interval 
bsised temporal reasoning scheme is capable of 
handling such situations. 

In the interval-based temporal reeisoning 
scheme, each of the fluents and operations are 
considered as an interval in time. They lie on 
a unique non-branching time line, and so, each 
of them is temporally related to all the other 
ones. A single primitive relation as a temporal 
relation between a pair of temporal entities in- 
dicates definite information between those two 
temporal assertions, whereas a disjunctive set 
of primitive temporal relations indicates incom- 
plete information. In the next section, the 
scheme for interval based temporal reasoning 
will be discussed. 


II. Interval-based Temporal Reasoning 

Suppose the following set of information is 
given in the context of petroleum exploration. 
GS = gravity survey 
SS = seismic survey 
DA = drilling activity 
PA = production activity 
ER = enhanced recovery 

Apart from these operations there are following 
fluents which are affected by the operations. ^ 
Surveys are done when there has not been any 
survey (NS) done on the area. Seismic survey 
produces subsurface geological knowledge(GK), 
it also produces huge noise(NO) caused by the 
artificial seismic explosions. Gravity survey can 
not be done during such noise. Drilling ac- 
tivity requires geological knowledge about the 
area, although it may make geological knowl- 
edge invalid. It also makes drilling wells avail- 
able(DW) which are required by production ac- 
tivity. Production activity produces data about 
the reservoir(RD) which is necessary for en- 
hanced recovery. Enhanced recovery is done 
for maximum production from a reservoir(MP). 
Actual temporal relations between them are 
given below. 

1. NS -{finished-by}-> GS ^ 

2. NS -“{finished-by}-> SS 

3. SS -{equal}-> NO 

4. NO -{before, meet, after, met-by}-> GS 

5. SS -{overlap }-> GK 

6. GK -{during-in verse, finished-by, overlaps}- 

> DA 

7. DA -{starts}-> DW 

8. DW -{during-inverse, finished-by, 
overlaps}-> PA 

9. PA -{overlaps, statrs, equal}-> RD 

10. RD -{starts, equal, overlap, meet, before}- 

> ER 

11. ER -{meets}-> MP 


^The information given here about petroleum pro- 
duction is not necessarily realistic. Preconditions and 
postconditions are simplified to a great extent. 

^For exact semantics of the 13 primitive tempored 
relations see[l]. 


Given these constraints, one may need to 
plan the activities for achieving maximum 
petroleum production from a reservoir, a state 
of MP, starting from a state of NS(no-srvey). 
For example, a trivial plan could be a serial or- 
dering of the operations GS, SS, DA, PA and 
then ER. 

Each of those operations or fluents is an in- 
terval in time. Some of the relations may not 
be consistent with respect to the others, A 
temporal reasoner’s primary job is to find out 
whether any consistent scenario is feasible or 
not, by propagating above constraints all over 
the temporal data base. In the context of plan- 
ning, this means whether there exists a plan or 
not. If there exists a temporally consistent sce- 
nario, or a plaUy then finding one such instance 
would be the next task of a reasoner. Formaliz- 
ing planning problem as an interval constraint 
propagation problem, is being addressed to in 
[2]. According to this formalism each interval 
is represented as node in a constraint graph, 
and disjunctive temporal relations as labels on 
the arcs between these nodes. The temporal 
constraint graph (TCN) is a complete graph 
because every temporal interval (we call it as 
Unode) is related to the other t-nodes, even if 
there is no specific information about how they 
are related. This complete lack of information 
is represented as disjunction of all 13 primitive 
relations (termed as tautology) . 

Any temporal constraint propagation algo- 
rithm systematically eliminates primitive rela- 
tions from the labels on arcs, which are incon- 
sistent. A global consistency algorithm^ elimi- 
nates all such primitive relations which can not 
form a consistent scenario for the temporal as- 
sertions in the network. In such case, any prim- 
itive relation on any label can take part in at 
least one consistent scenario. Such a labelling is 
called minimal labelling. If during propagation 
any label gets all its primitive relations stripped 
oflf, having a null relation, then it implies that 
the two end nodes can not have any consistent 

® Globed consistency edgoritlnn checks for consistency 
of constraints aU over the network, in contrast to a lo 
ced consistent algorithm, which diecks for consistency 
of every subnetworks of a fixed size. 


labelling between them. This in turn implies 
the given constraints were inconsistent with re- 
spect to each other, and any consistent tempo- 
ral scenario can not be formed. 

Interval-based temporal reasoning increases 
expressiveness of planning. But the main prob- 
lem with this scheme is that the problem of 
checking global consistency is NP- complete. 
There are approximate algorithms to address 
the problem and have inexact solutions[l, 10]. 
But generating a plan needs a temporally con- 
sistent model. Lack of this aspect was one of the 
weaknesses of the original work in this line by 
Allen et al[2j. In their scheme the arcs of the 
network will be left with consistent labelling, 
which is still a disjunctive set. Making a total 
order of the nodes is not feasible from such la- 
belling, which is demanded by a planning prob- 
lem. To manage the efficiency issue they have 
proposed a clustering approach. This would 
keep the network size under control so that run 
time of the algorithm is less affected by the in- 
crease in problem size. In this scheme all inter- 
vals should be clustered into a few groups based 
on some reference intervals, and propagation al- 
gorithm runs separately within each group. Al- 
though in some problem domains such hierar- 
chy of reference intervals may be inherent, in 
many domains, like planning, this approxima- 
tion may be impractical. 

III. Finding All Feasible Temporal 
Scenarios 

A global consistency algorithm finds out 
a globally consistent models or all consistent 
models as a side effect while trying to determine 
global consistency of a network. Any trivial 
backtrack algorithm can do this work in expo- 
nential time. It is important to devise heuristics 
to make it efficient. First heuristic based global 
consistency algorithm was proposed by Valdes- 
Perez[12]. There, the algorithm was not written 
well enough to be efficiently implemented. Re- 
cently Ladkin et al[6] and we [8] have come up 
with two different heuristic based algorithms for 
solving global consistency problem. 

Ladkin et al’s algorithm randomly picks up 
a singleton relation from an arc and runs an 


approximate algorithm to update relations on 
the other arcs while checking for consistency 
of the picked up primitive relation. If picked 
up relation is found to be inconsistent the al- 
gorithm backtracks over the set of disjunctive 
relations on the current and previously picked 
up arcs, otherwise it goes ahead with the next 
arc. The algorithm terminates if a singleton 
labelling is found for all arcs, generating a con- 
sistent model. It may also terminate by back- 
tracking up to the arc, which was picked up 
first, implying no consistent scenario is avail- 
able. Experimental results, presented by them, 
provides us with a confidence that the global 
consistency is a not a very difficult problem in 
average case, although its worst case behavior 
is NP-complete^. 

Table 1; Forward Pruning Algorithm 

For node number t=l to N do 

pickup an old singleton model of size (t — 1); 
for node number j=l to * — 1 do 

pickup singleton relation from label on 
arc < i, j > and update labels on aU arcs 
from < t, j + 1 > to < t,i — 1 > with respect to 
this singleton and arcs in old network; 
if updation fails on some arc 

if there is any more singleton left on this arc 
pickup next singleton and proceed 
otherwise backtrack to previous arc; 
if backtrack exhausts up to the first arc 
if a model was found 

save it in data base of partial models 
(of size t); 

otherwise return failure; 
if arc < t, i — 1 > is reached force backtrack; 

The algorithm proposed by us[8] is based on 
a pruning heuristic where inconsistent relations 
are eliminated systematically. We call it for- 
ward pruning algorithm, because it prunes la- 
bels on all forward arcs in a preassigned order. 
A lose version of the algorithm is given in Ta- 
ble 1. There is reason to believe that such sys- 
tematic working should improve the efficiency. 
Another feature in this algorithm is that the 
nodes are also systematically picked up one by 

very good example of such cases where a NP- 
complete problem is addressed comfortably in most in- 
stances, is the case of linear programming using simplex 
algorithm. 


one. At each stage of such addition of a node 
in the graph, a set of feasible models are gener- 
ated. Then the next node is attempted for addi- 
tion to each of these models. This model based 
approach makes it possible to generate all fea- 
sible consistent scenarios at a much lesser cost. 
In the worst case, the number of models itself 
may be exponentially growing with the num- 
ber of nodes. Some preliminary experimental 
results with this algorithm have shown that in 
reality this growth rate is not very high, and 
manageable. 

Efficiency of the forward pruning algorithm 
will also be aided with a fast preprocessing al- 
gorithm, which we have devised recently for in- 
cremental addition of nodes to temporal con- 
straint graph[9]. The incremental and model- 
based approach will also make a plan data base 
available, in which new operations can be added 
one by one. Then new plans can be generated 
with respect to these newly added operations, 
or any newly added information. Significance 
of having all feasible plans will be discussed in 
the next section. 

IV. Significance of Having All Feasible 
Plans 

Availability of all feasible plans allows one to 
have a choice of picking up the best plan accord- 
ing to some criteria. For example, in the previ- 
ously mentioned case of petroleum exploration, 
one can find out a plan whose execution will 
take lesser time than most other plans. This 
can be done by choosing a plan where meucimum 
number of ‘overlap’ relations occur (for paral- 
lely executing those operations)®. This proce- 
dure can also be automated by assigning some 
order of priority on the primitive relations on 
each of the labels on arcs. The resulting mod- 
els, or plans, will then also be ordered by the 
system accordingly. 

Normally a planning activity involves reach- 
ing a preassigned goal state. Most of the cur- 
rent planning systems are based on this objec- 
tive. A temporal reasoning-based planning sys- 

*Thus, a good pUin there, is SS after GS, SS over- 
lapping DA, DA overlapping PA and PA overlapping 
ER. 


tern, as described in this article, need not be 
goal driven only. Using a global consistency 
algorithm one can generate all possible scenar- 
ios, which allows one to do temporal projection. 
Then one can choose appropriate goal state and 
a corresponding plan, from those future scenar- 
ios. Thus, planning becomes a subproblem of 
temporal projection problem[7]. As far as we 
know the implication of this is yet to be stud- 
ied. 

In a dynamic situation where an agent is ex- 
ecuting a plan unknown by another agent, if all 
feasible plans for the first agent are available, 
then the second agent can recognize the first 
one’s currently executing plan by observing its 
operations executed so far. The plan recogni- 
tion here becomes a problem of matching ac- 
tions executed in past, or a partial plan, with 
the plans in the plan data base, and finding out 
the possible candidate plan(s), which the first 
agent might be executing currently. 

There is a related problem of answering 
queries about plans. In our example, there may 
be a question whether seismic survey can be 
consistently done along with enhanced recovery 
under any plan. If the answer is ‘yes’, then the 
question would be what is the complete plan 
(or plans) under which that can be done. In 
the formalism described here, such questions 
can be answered using same pattern matching 
technique described in last paragraph, and it 
can also be done in real time, because of the 
availability of all plans. This may be an impor- 
tant advantage for an autonomous agent in any 
realistic environment. Isolating all possible sce- 
narios also make it e 2 isy for updating plan data 
base with new information, which is a matter 
of adding or dropping consistent models. 

In a time limited application, a time limit 
can be imposed when plans are generated, to 
create as many consistent plans as possible, 
rather than generating all plans. This will, of 
course, reduce some of the advantages discussed 
above. However, under some real life circum- 
stances, that may be a better solution than to 
fail to generate any plan because sufficient time 
was not allocated. 


V. Conclusion 

In this article we have discussed the feasi- 
bility of planning using interval-based tempo- 
ral reasoning scheme. An advantage of using 
temporal representation is that of incorporating 
uncertainty of information about the temporal 
relation between different temporal entities in 
the domain. There can be many plans feasible 
under this circumstances. Additional advan- 
tage of interval based representation is of higher 
expressiveness, which will enable new types of 
planning not attempted before. The problem 
of interval based reasoning is that of its hard- 
ness. We have devised an efficient algorithm 
which can find out all feasible scenarios under 
incomplete information, in most of the circum- 
stances. The implication of having all feasible 
plans available is multifold. They are also dis- 
cussed here. So far nobody has taken any look 
into these aspects, although practicable algo- 
rithms for finding out consistent scenario is re- 
cently coming into fore. 
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Abstract 

Planning researchers are coming to accept that plan- 
ning is not sufficient for robotics. But many of them 
seem to think that the concrete levels of robot control 
which a planner must interface are somehow uninter- 
esting or unimportant to the development of a theory 
of autonomous control. On the other hand, some 
robotics researchers appear to think that planning is 
not even necessary for robotics, that reactive control 
will be adequate for any realistic robot system. Our 
thesis is that progress toward autonomous robotics 
will be stunted until both camps understand that the 
other plays an equally important role in the creation of 
non-trivial intelligent autonomous agents. This paper 
describes the role of planning and reactive control in 
an architecture for autonomous agents (robots). We 
posit this is necessary and sufficient. The key to our 
architecture is the interjection of a “sequencing layer” 
between the reactive controller needed for a robot to 
survive in a dynamic environment and a deliberative 
planner needed to develop a course of action to achieve 
high-level user goals, 

1 Past: Robotic control as a 
domain for planning 

Controlling an autonomous robot is mentioned in 
many planning research papers as a typical domain. 
But few planning researchers discuss, or even appear 
to understand the practical problems of robotic 
control. The following is the prototypical answer to 
complaints of the more practically minded robotics 
engineer: “I have captured the essence of the robot 
planning problem with a representative problem in 
which it is easier to present a discussion of all the 
important issues. The details are unimportant.” 

That justification would be acceptable if there were 
evidence that the details are really unimportant. But 
we know from the recent spate of papers on the 
complexity of planning that making planning practical 

Copyright © 1993 American Institute of Aeronautics and 
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are in the details. Abstractions into a “representative 
problem” obscures the fact that classical AI planning 
can only address one segment of the robot control 
problem. 

Robotic control from the planning researcher’s 
perspective is often viewed as the task of moving from 
one location to another [20, 16, 3, 15, 24, 8, 14]. The 
desired plan would be a sequence of movements for 
the robot to carry out. Such movement plans usually 
assume sensing systems capable of generating accurate 
metric models of the portion of the world relevant 
to the navigation task. When people got around to 
building robots that could benefit from this type of 
planning, they found that nearly all of the interesting 
behavior could be accomplished more efficiently by the 
mechanisms assumed by the planner. 

The rasion d’etre of deliberative planning is the 
need to reason about preconditions. It is not too 
difficult to construct practical examples of why this 
is necessary for an autonomous agent. If you have 
more in mind for the robot than getting from here 
to there, you can usually convince yourself that 
you need to think about it before “heading in the 
right direction” and relying on reactivity. Dealing 
with complications like another agent loose in the 
environment or the need to do something like repair 
a broken device (so you need to gather the right tools 
before you leave) are not activities a purely reactive 
system can accomplish any more efficiently than a 
classical planner can accomplish movement planning 
in unstructured environments. 

Chapman proved that planning is computationally 
intractable and could not possibly be a sufficient the- 
ory of intelligent behavior in a real-time environment 
[5]. Others have catalogued some of the behaviors 
one would need in an autonomous agent: reaction 
to unforeseen events, iterative actions (e.g., traveling 
down a street stopping for all the red lights), real 
time projection and conditional commitment to action 
(e.g., cross a busy highway [19]). 
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Chapman was the vanguard of the '‘situated rea- 
soning” approach to intelligent agents [4, 1, 6, 17]. 
These “reactionaries” started at the opposite end of 
the control continuum from classical AI planning, 
bent on showing that planning may not even be 
necessary for intelligent behavior. But the problems 
the reactionaries cannot address turned out to be 
important to producing an autonomous agent. Among 
the more obvious are the inability to employ predictive 
reasoning about preconditions (check the gas gauge 
before embarking on a long car trip, don’t wait 
until the car starts sputtering), coordinating activity 
with other intelligent agents (pick up a heavy object 
together), reason under uncertainty and take risk- 
alleviating actions (move into the right lane well before 
your exit). 

Our hypothesis is that planning is necessary for 
control of autonomous robots, just as situated reason- 
ing or reactive behavior is necessary. The main issues 
are mediating between deliberation and reaction to 
produce seamless intelligent behavior, and determin- 
ing the proper roles of the various components of an 
intelligent agent architecture. 

2 A three-layer architecture 
for intelligent agents 

The robot intelligence community has begun to agree 
on a software architecture for “intelligent” robotic 
systems [10, 18, 12, 7, 11, 23, 25], The consensus 
emerging is an architecture which incorporates both 
planning and reaction. In most of the architectures 
cited, there is a planning component for reasoning 
about the overall mission and generating contingencies 
when the mission itself is in danger of failing. 
Importantly, the planner is asynchronous (but on- 
line) with a real time reaction component which can 
achieve the situated reasoning needed for survival and 
continuous control. 

While it now seems obvious there is a role for 
reaction and planning in robot control, what is not 
so obvious is how to mediate between the two. Our 
architecture separates the general robot intelligence 
problem into three interacting pieces, with the middle 
piece being the key to mediation between reaction and 
deliberation (see Figure 1): 

1. A set of robotic specific reactive skills. For 
example, grasping, object tracking, and local 
navigation. These are tightly bound to the 
specific hardware of the robot and must interact 
with the world in real-time. 

2. A sequencing capability which can differentially 



Figure 1: Generic intelligent control architecture 


activate the reactive skills in order to direct 
changes in the state of the world and accomplish 
specific tasks. For example, exiting a room might 
be orchestrated through the use of reactive skills 
for door tracking, local navigation, grasping, and 
pulling. 

3. A deliberative planning capability to reason in 
depth about goals, preconditions, resources, and 
timing constraints. The planner generates rough 
plans for accomplishing goals. For example, given 
the task to retrieve an item and a map of a 
building, the deliberative system could reason 
about the interconnection of spaces and return 
a plan for the robot to exit the room, follow the 
hall to the left and enter the third door on the 
right. 

This paper focuses on the interaction of planning 
and sequencing layers of this architecture. Interaction 
of the sequencing system with the reactive layer of the 
architecture is covered in other papers [23, 25]. 

3 Sequencing: caching 
techniques for handling 
routine activities 

Reactive control addresses only the most obvious 
defect of state-based planning for robots, the inability 
to represent and reason about motor control in real 
time. But even if provided with primitives such as 
grasp, track- wall, and so on, a state-based planner will 
still be unable to efficiently handle everything needed 
for robot control above the level of reactions that can 
be compiled into guaranteed-reaction-time primitives. 
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The most obvious problem is the need to do indefinite 
iteration of sequences of primitive behaviors: 

Do unitl (at robot 1 doorB) : 
Put-one-foot-in-f ront-of-the“Other 

The problem with such sequences for a state-based 
planner is they produce an indeterminate number 
of states to manage. A plan is basically a proof 
that some goal can be achieved with a sequence 
of state transitions effected by atomic plan steps. 
Planners convince themselves a goal is achievable 
by constructing the whole plan. Obviously, it is 
difficult to generate a complete state- transit ions plan 
to embody what we easily expressed by the iterative 
construct above. That is the reason typical planning 
representations ground out on primitives like the 
following: 

(Operator move 

: purpose (location ?robot ?destination) 

: preconditions ((clearpath ?robot ?destination) 

...)) 

Such operators assume things which are hard to 
encode as states, like keeping away from walls and 
people. They also assume indefinite sequences of 
very small grain actions. Situated actions typically 
implemented in robots often match the state transition 
view of classical planning perfectly well [13]. The 
problem is two-fold. One is performance; at the 
level of abstraction provided by situated actions there 
would be too many plan steps to generate for even very 
simple goals like moving from within a room out into 
a hallway. The second is representation. No one has 
developed a useful state-based planner that can reason 
about indefinite iteration of actions and conditional 
actions. 

What is needed is a layer between reactive be- 
haviors and deliberative planning which allows the 
planner to reason only to the level of routine activities 
such as move and open-door (grasp, turn, and pull. If 
that fails, push. If that fails consider another route 
to the goal). In our architecture, we use Reactive 
Action Packets (RAPs) to encode routine behavior as 
a sequence of situated skills [22] . RAPs is a language 
with a syntax similar to the syntax of classical 
planning systems [10]. Like most planning systems, 
the RAP system uses a library of decomposition rules 
to represent sequences of behaviors to accomplish 
a task. The system can quickly transform a ta;Sk 
into a context specific sequence of primitive actions 
by caching solutions to common tasks. Unlike a 
planning system’s computationally expensive search 
mechanisms used to decompose tasks into primitives. 


the RAP system must have a solution to the given task 
cached in its library or the system reports a failure. 
RAPs can encode conditional and iterative sequences 
of actions since there is no state-based search involved. 
As is exemplified by the following example, the door is 
iteratively bashed with a sledge hammer until the not 
closed state is detected or simply opened if the door 
is unlocked. 

(def ine-rap opendoor 

(success (not (closed ?currentdoor) ) ) 

(method 

(context (doorlocked ?currentdoor) ) 

(tl (grasp~s ledge-hammer) (for t2)) 

(t2 (pound-door ?currentdoor) 

(wait-for (not (closed ?currentdoor) ) ) ) ) 

(method 

(context (not (doorlocked ?currentdoor) ) ) 

(tl (grasp-door-handle) (for t2)) 

(t2 (turn-knob) (for t3)) 

(t3 (pull-open-door) ) ) ) 

While the RAP system can perform task decom- 
position, it is not suited for direct interaction with 
the world. The software constructs that are used 
for selecting action routines, binding variables, and 
so on make the system too slow for survival. Thus 
the system is used in our architecture to dynamically 
configure a reactive layer to handle the interaction 
with the world for the current task and situation. This 
allows complex behaviors to be programmed, while 
relying on always-active situated skills to protect the 
robot from inaction in a rapidly changing environmen- 
t. 

Sequencing, married to reaction, yields significantly 
better task coverage than either of the two can 
provide alone. Still, the combination of the sequencing 
and reactive layers is not structured to perform 
complicated resource allocation reasoning. Such is 
typical in determining the best way to carry out a set 
of tasks. Nor are these two layers good at reasoning 
about the failure requirements or consequences of a 
task. So where the sequencer gains in its ability to 
handle routine situations (e.g., starting a car, opening 
and moving through a door), it lacks the ability to 
string these routine tasks together in a way that will 
have the desired “global” behavior. But that happens 
to be just the thing planners are good for. 

4 Planning 

Our view is that there is a role for state-based planning 
in robotic intelligence, but it should be limited to 
tasks that are not easy to specify as sequences of 
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common robotic skills. When planning is necessary, 
the planner should think of the problem at the highest 
level possible in order to make the problem space the 
smallest possible. 

Thus, the role of planning is to deliberate, but only 
when necessary. The role of reaction is to control 
real-time behavior. The role of sequencing is to 
raise the level of abstraction of the lowest level of 
activities which the planner will concern itself. In the 
process, eliminate the need for state- based planning 
of things which are easy to encapsulate in an operator 
that grounds in an indefinite iteration of low-level 
skills. Importantly, all three layers must operate 
concurrently and asynchronously. Accomplishing this 
is the key to making planning useful in a robot. 

Because the sequencer has a cached solution to 
routine tasks, the planning system has the advantage 
of building upon this level of abstraction providing 
it larger grain sized primitives. This eases the 
complexity of the “planning problem” because it 
eliminates large numbers of essentially linear planning 
problems. Ability of the RAP system to deal with 
iterative behavior greatly simplifies the planner’s rep- 
resentation, allowing the simple state representation 
common to classical planning to suffice in most 
common situations. 

The planner we are using in our experiments is 
called AP [9]. AP has a number of features which 
make its role more compelling than robot planning 
typically exemplified in the planning literature. 

One thing missed by both the planning and robot 
control communities (while they were arguing over 
the necessity of planning) was autonomous robots 
generally will be autonomous only from the human 
giving them orders. They will not often be acting 
alone in their environment. Multiagent control is 
necessary when more than one robot is employed 
to carry out tasks, or when multiple robots are 
operating independently on multiple tasks in a shared 
environment. 

AP was designed to deal with multiagent coordi- 
nation. To do this, it extends state-based planning 
to reason about the conditions that hold during 
actions. This capability allows AP to plan activities 
such as two robots carrying a bulky object. The 
following operator is an example from a test domain. 
Note the planner can instantiate the variables ?arm- 
or-robotl and ?arm-or-robot2 with anything that 
meets the constraints. A two-armed robot or two 
single armed robots might be used. The temporal 
relation “simultaneous” imposes a non-codesignation 
constraint on the agents so that a very strong one- 
armed robot would not qualify. Other temporal 


constraints in the plot language would allow codes- 
ignation). These plot temporal constraints also cause 
the plans steps that instantiate the plot subgoals to 
include scheduling information that the sequencing 
layer and AP’s execution monitor can use. 


(Operator pickup-heavy-object 
: purpose (holding Tplanner ?large-thing) 

: arguments 

( (?weight -of -thing 

(get-value ?large-thing ^weight))) 

: preconditions 

((top ?large-thing clear) 

(on ?large-thing ?something)) 

: constraints 

((can-lift ?arm-or-robotl 

(*0.5 ?weight-of -thing)) 

(can-lift ?arm-or-robot2 

(♦ 0.5 ?weight-of -thing)) 

:plot 

(simultaneous 

(grip ?arm-or-robotl ?large-thing) 

(grip ?arm-or-robot2 ?large-thing) ) 

: effects 

((holding ?planner Tleirge-thing) 

(top ?something clear) 

(on ?large-thing nothing))) 

Another feature of AP which makes it appropriate 
for control of autonomous agents is that it can 
reason about uncontrolled agents. AP was originally 
developed to address multiagent adversarial domains 
(AP stands for Adversarial Planner). Of course 
most robot applications are not adversarial. An 
uncontrolled agent might be a human operating in the 
environment along with a robot, or even nature, AP 
can use its adversarial reasoning capabilities as a risk 
assessment mechanism to decrease the probability of 
dangerous interactions with other agents. 

AP includes a “counterplanning” component to 
reason about how an uncontrolled agent might prevent 
a plan from succeeding, either by negating a precondi- 
tion or a “during condition.” Problems are uncovered 
and addressed by augmenting the plan with operations 
which prevent the negative effects of the uncontrolled 
action. This amounts to reasoning about situation- 
specific preconditions, and is the way AP addressees 
the “qualification problem” [21]. 

A typical example of this type of reasoning in a 
robot application might go as follows. The robot is 
assigned the task of repairing a device. In the course 
of the planning process it might post a “protection 
interval” on the condition that a door remain open 
for the duration of some operation. Counterplanning 
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might discover that a human could close the door, and 
a way to prevent this could be to post a notice that 
the door must be open until further notice. 

AP has other features which are needed to fully ad- 
dress the requirements of an intelligent agent. These 
include: reasoning about metric time (scheduling), 
execution monitoring, and replanning. Execution 
monitoring allows the agent using AP to recognize 
and skip plan steps that are overcome by events, and 
to replan when something unexpected occurs (e.g., a 
door that should have been left open is closed and 
locked). Execution monitoring also projects ground 
truth observations through the state space generated 
during planning. When ever there is a change in a 
output situation proposition, AP’s execution monitor 
rechecks preconditions and recalculates certain “re- 
computable effects” of subsequent plan steps. This 
permits AP to predict failures, rather than waiting 
to notice them, as would happen if the agent relied 
only on situated reasoning and sequencing. This 
is obviously something the planner should be doing 
outside the sense-act cycle of the plan executor. 
In fact, AP was designed from the beginning to 
be a deliberative process aloof from the execution 
environment. 

Replanning in AP is cued by the execution monitor 
when it notices or predicts failures. Replanning in 
AP is based on the concept of a “minimal repair 
wedge.” AP assumes the majority of a plan is 
salvageable. The idea is to excise the minimum 
number of plan steps dependent on the failed step 
and replace them with a “wedge” of operations that 
achieve the original subgoal with alternate means. 
This strategy is both more computationally efficient 
and cognitively plausible than planning again from 
scratch, which is what most classical planners are 
doomed to do. 

5 Implementation Status 

We have completed implementation of an interface 
between a RAP- based sequencing system and a facility 
for providing a set of situated skills which the 
sequencer can manipulate to cause activity in the 
world [22, 25]. We are now combining AP with the 
RAP-based sequencer. 

Shortly we expect to begin testing our architecture 
on a number of problems. Questions we plan to 
address include the following: 

1, What activity is appropriate for planning or 

sequencing layers? 

2. What domains where reactivity is sufficient? 


3. Are there domains where sequencing is sufficient? 

4. In which domains is deliberation highly useful? 

5. To what measure is the architecture beneficial 
over more ad-hoc approaches? 

6 Future Work 

After we complete testing our architecture, we plan to 
explore the architecture as a basis to incorporate in 
robots certain cognitive capabilities normally associ- 
ated with intelligent behavior. The first area we will 
investigate is learning. 

We are guided in our ideas about learning by An- 
derson’s ACT* model of cognition [2]. In ACT*, one 
of the important uses of learning is for performance 
improvement. It is hypothesized that expertise is 
gained by compiling common sequences of primitive 
actions into routines which henceforth take the agent 
essentially no time to derive. This type of learning 
can be easy in our implementation because the RAP 
library can be dynamically modified. For example, 
the robot could “learn” the commonly used plans for 
getting places from its nominal home (e.g., to the mess 
hall, the latrine). Since AP’s plans are parameterized 
and have syntax similar to RAPs, the system could 
compress common sequences of operators into RAPs 
and then add that RAPs as a planning primitive. 
Thus, the next time the robot has a goal to eat dinner 
it would not have to invoke the planning system to 
accomplish the task of getting to the proper location. 
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Abstract— A robotic planning and control system 
based on the subsumption architecture is de- 
scribed. The subsumption planner extends the 
purely reactive subsumption architecture by ex- 
tending the sensor space (from which the behavior 
modules are triggered) into a virtual future^ and 
augmenting the behavior module with a cause-ef- 
fect predictor triggered by the same sensory situa- 
tion as the reactor. Virtual sensor space is used by 
the planner as a scratchpad to visualize alternate 
plans. The predictor contains a partial world 
model relevant to its particular behavioral 
expertise. The collective network of predictors op- 
erates in parallel with the reactive network form- 
ing a recurrent network which generates plans as 
a hierarchy. Details of a plan segment are gener^ 
ated only when its execution is imminent according 
to the principle of least commitment. An imple- 
mentation of subsumption using object oriented 
design is proposed. The behavior of the subsump- 
tion planner is demonstrated in a simple maze 
navigation example. The subsumption planner is 
expected to improve the robot* s performance by 
reducing feedback delays and unnecessary de- 
tours. It provides a framework for general behav- 
ioral planning in real-world robots of the sub- 
sumption style. 

1t introtitictlQn 

Reactive robotic control systems, such as the sub- 
sumption architecture, have enjoyed popularity 
among the research community for the last few 
years. Robots built according to these principles 
are very successful at performing tasks in unstruc- 
tured, real-wwld environments. Howev^, reactive 
systems tend to behave in a pre-programmed, rote 
manner. Selecting altonate courses of action based 
on previous experience or reasoning (i.e., delibera- 
tive behavior) must be designed into the behavior 
itself rather than being an emergent prc^rty of the 
netwwk inteiconnectivity. Rec^t work has recog- 
nized a need to incorporate deliberative planning 
capabilities in leal-world systems. 

Planning is essentially the ability to look ahead 
and predict outcomes of actions (or inactions), and 
to m^e decisions based on that knowledge. A plan 


is a sequence of decisions to be made in the future. 
These decisions assume a particular expected se- 
quence of states. Anticipation of future states gives 
the planning robot advantages in efflciency (via 
the ^ility to avoid known dead ends), in flexibility 
(replanning on the fly), and in reaction speed 
(eliminating most of the delay inherent to reactive 
feedback-only systems). Autonomy in the real 
world demand predictive and deliberative behav- 
ior in addition to a reactive component 

The subsumption planner uses a parallel dis- 
tributed computational paradigm based on the sub- 
sumption architecture for the control of real-world- 
capable robots. This approach is derived from a 
number of various disciplines including ethology, 
the theory of animats, and computational neuro- 
science. Plans are represented as trajectories in a 
time-extended virtu^ sensor state space. States 
along this trajectory represent the future as the 
robot expects it to appear, in terms of its own 
senses. Virtual sensor state s[»ce is used as a plan- 
ning tool to visualize the robot’s anticipated effect 
on its environment 

Decision sequences are generated by the planner 
based on the environmental situation expected at 
the time the robot must commit to the decision by 
acting on it Between these decision points, the 
robot performs in a pre-programmed mann^, lim- 
iting its reactions to avoiding obstacles, stalls, and 
danger. A rudimentary, domain-specific partial 
world model contains enough information to ex- 
trapolate the end results of rote behavior between 
decision points, and thus to predict the sensed situ- 
ation at the next decision pdnt 

By constructing plans with little detail as long as 
they are far in the future and filling in details only 
as Aeir execution becomes imminent (the princqile 
of least commitment), failed plans can be dis- 
carded without much resource cost. Rough plans 
are represented as indefinite trajectories between a 
few well-defined waypoints in the state space. 
These waypoints are the expectations graerat^ by 
the predictors. Initially, they are spread far apart in 
time — the plan is coarsely resolved. Rough plan- 
ning consumes few resources. As the first plan seg- 
ment comes closer to fruition, details are added 
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just in time for execution. These multi-resolution 
plans are built from the outside in. 

Planners need a world model to evaluate alterna- 
tives. One objective of the subsumption planner is 
to distribute the world model in a plausible way 
across the entire decision-making system. Each be- 
havior module contains a discrete piece of exper- 
tise. These expert agents contain not only actimi 
generatitHi routines, but also cause-effect predic- 
tors. The planner is implemented as a network 
structure which parallels the behavior module net- 
work of the subsumption architecture. 

Implementation of subsumption as an object-ori- 
ented design simplifies the implementation of a 
software simulatirxi, as well as providing organiza- 
tional and devel(q>mental flexibility. The subsump- 
tion network structure lends itself to the use of dis- 
tributed hierarchical encapsulating objects. In ad- 
dition, the hierarchy enables prioritization of active 
behaviors to occur before the generation of actual 
motor outputs. This simplifies the prediction 
mechanism. 

2. Related Work 

As the shortcomings of monolithic, centralized 
robotic controllers and planners became obvious, 
new distributed architectures were proposed. These 
were rooted in a variety of different nontradidonal 
disciplines, including ethology, biology, neuro- 
science, physics, psychology, and sociology. A 
strongly ethological and intuitively elegant ap- 
proach is espoused by Albus^ . Marvin Minsky, a 
pioneer in artificial intelligence, theorizes about 
the nature of information processing in the human 
mind, speculating that thought is composed of the 
collective actions of a society of individual, sub- 
intelligent agents acting cooperatively^. The new 
architectures share many features with object-ori- 
ented paradigms. Schema theory^, in particular, 
models intelligent systems firom Ae p^pective of 
brain thewy as hierarchical networks of nested ob- 
ject-like schemas. Schema theory and object-ori- 
ented design are both rooted firmly in distributed 
artificial intelligence. 

Sensor and motor information (indeed, any type of 
information at all) can be described in terms of dy- 
namic high- or infinite-dimensional state spaces^. 
A trajectory through this state space represents the 
time evolution of information or concepts. This 
genetic framewoik has been developed into a de- 
scription of physical and mental behavior, which 
can describe many diverse information processing 
techniques and knowledge representations. 


The subsumption architecture developed by 
Brooks at MIT*- introduced a new pwspective 
on building robots for the real-world, often called 
creatures or animats. Coimell’ further develops the 
subsumption technique, describing an autonomous 
robot whose job it is to find and collect soda cans 
in a lab and deposit them in a receptacle. This 
robot has served as a testbed for an in-depth study 
of subsumption and its numerous advantages ova 
earlio' meAods for controlling real-world robots’. 
The major achievement of the subsumption archi- 
tecture is to demonstrate how to combine many 
isolated pieces of robot control into a single, 
working real-world machine opoating under a sin- 
gle, cohoent, and consistent architecture. 

Earlier robots were effective only in toy worlds: 
well-defined environments, such as in manufiictur- 
ing, whoe repetitive movements could be guaran- 
teed to be successful. These applications do not re- 
quire much sensory ability. The robots built by 
Brooks react in retd-time to changes in the envi- 
ronment. They rely on a rich suite of sensors to 
provide information about the environment on 
which to act The actions themselves are generated 
by a network of behavioral modules whose outputs 
are mediated by a hardwired arbitration network.of 
gating nodes. This network gives certain higher- 
level behavioral modules responsibility for, and 
control over, the outputs from lower-level mod- 
ules. 

In addition to performance advantages due to di- 
rect implementation on parallel hardware, the sub- 
sumption architecture provides all the benefits of a 
distributed system, including scalability, graceful 
degradation, robusmess, simpler elements, and bi- 
ological plausibility. Much of the intelligence of 
the syst^ emerges from, and is embodied in, the 
network connectivity. 

The lack of an explicit world model was originally 
seen as an advantage of subsumption. Using the 
real world as its own model solved the problems of 
keeping the model up to date and deciding what 
was important to include, as well as what form of 
representation to use. However, without a world 
model, a system or organism cannot anticipate the 
effect of an action until that effect is actually 
sensed (giving rise to feedback delays). This can 
lead to poor reaction time and potentially life- 
threatening slowness. 

The purpose of a world model is twofold: It keeps 
track of the state of the world, and it describes 
cause-effect relationships due to actions (i.e., mov- 
ing an object) or inactions (allowing an object to 
fall). These are data structures and processes re- 


q)ectively. The cause-effect relations may be due 
to laws of physics, to the robot’s own actions, or to 
the actions of another agent, in order of increasing 
difficulty of prediction. In the subsumption archi- 
tecture, these cause-effect relations are implicitly 
designed into the behavior modules. For example, 
t^mination conditions assume a cause-effect rela- 
tionship due to physical laws. The result of ac- 
tively moving an object from x to y is that the ob- 
ject eventually exists at y. The fact that the object 
is at y can either be represented internally in a 
world model, or, in the subsumption model, sensed 
again only when necessary. 

Recently, sevoal modifications and extensions to 
the subsumption architecture have been proposed. 
These address limitations or inefficiencies of sub- 
sumption such as the lack of a world model, or its 
lack of learning ability. 

Mataric’° extends subsumption by implementing 
knowledge representation as a map (a kind of 
world model) for navigation by integrating an in- 
ternal landm^k-based map representation built as 
behavior modules. Behavior modules are used here 
to represent as well as act. In this case, each 
module represents a landmark. The building blocks 
of the subsumption architecture remain fundamra- 
tally the same; Mataric describes a new usage of 
them. 

Dorigo and Schnepf" propose a learning technique 
for behavior-based robots in which new behaviors 
are created and added to the robot’s repertoire us- 
ing a genetic algorithm. The behavior-based archi- 
tecture is based on ethological theories developed 
ovCT the last eighty years'^. They note that belmv- 
iors have been constructed which are well tailored 
to specific tasks, but no conceptual model exists 
for relating behaviors to each other. Learning is 
achieved by evaluating new behaviors and includ- 
ing them if they perform well. The resulting sys- 
tem is a synthesis of behaviwal and genetics-ba;^ 
paradigms for intelligent real-world behavior. 

Learning by progressive modification of a reper- 
toire of reactive behaviors is described by Lyons^’, 
using a process-algebra language. The concq>t of a 
planner as a separate subsystem which interacts 
with a reactor is presented. The plan is contained 
in the reactor as a set of processes. The reactor 
adds and deletes processes, tuning the reactor to 
perform the task more robustly over time. The 
planner has a repertoire of plan elements and the 
knowledge of how to fix shortcomings in the 
reactors overall behavior. It provides the system 
with a global perspective on task execution which 
is lacking in the purely reactive system. 


Traditional planning systems constructed plans 
off-line, often in a separate centralized planning 
subsystem. Planners were seen as logical engines 
which generated a complete, detailed plan as a re- 
sult of a search through a tree of possible action se- 
quences*'*’ These plans worked best in a toy 
wwld. Distributed planning systems were eventu- 
ally developed as a consequence of the focus on 
distributed artificial intelligence in the late 1980’s. 
One of these was the Distributed Vehicle 
Monitoring Testbed**- *’• ** which distributed par- 
tial plans among independent agents. In this 
scheme, plans were subdivided spatially. 

In the past, planning has been studied in isolation. 
It is now brooming widely accepted that the pre- 
dictive, deliberative planning component is neces- 
sary to complement the reactive behavior-based 
system. These two systems must be highly inte- 
grated, yet distributed across the network. 
Planning, in a sense, rq>resents the antithesis of re- 
ality-bas^ reactive systems. A planner must imag- 
ine nonexistent realities and visualize and evaluate 
alternative future actions in that virtual reality. 

3. Architecture 

Reactive robot controllers implement a feedback 
loop for all activity (Fig. la). Feed/brward loops 
(Fig. lb) are more responsive and exhibit 
smoother, more desirable control, provided the 
model is accurate. The feedforward control tech- 
nique has been used to mimic the smooth, pre- 
cisely controlled behavior of the human arm by a 
robot arm using highly non-linear air-blad^r 
“muscles” *’• ^*. The feedforward loop relies on 

an internal model of physical cause and effect to 
predict and generate appropriate actiems in real 
time, before their effect can be sensed. The 
Kawato-Katayama arm trajectory generators, im- 
plemented as neural networks, act as continuous- 
time predictive models the physics of arm behav- 
ior. 



Fig. 1a. Feedback only reactive system 



Fig. 1b. Feedforward reactive/predictive system 

The subsumption planner adds to traditional sub- 
sumption the ability to predict the effect of robot 
actions. The action-geno-ation expertise is collo- 
cated with the knowledge of cause-effect relation- 
ships in the behavior module associated with that 
domain. This cause-effect knowledge is encapsu- 
lated in a predictor which anticipates the expected 
state resulting from performing the behavior. The 
expectations are represented as state vectors in the 
virtual sensor state space representing future 
states — the collection of these states together de- 
fine a trajectory representing a future course of ac- 
tion. 

Subsumption networks are often seen in which in- 
formation flows in one direction from input to 
output There are no recurrent links in this type of 
network. The lack of recurrent links allows only 
simple reactive behavior to be produced, preclud- 
ing complex dynamic behavior. In the same way, 
non-recunent neural networks such as the back 
propagation network do not contain any recurrent 
information flow. Non-recurrent networks are not 
dynamical systems, and therefore they carmot ex- 
hibit chaotic or even oscillatory time-varying be- 
havior. This severely limits the behavioral com- 
plexity and sophistication of these networks 
(although they are more easily understood). 
Similarly, in the subsumption architecture, any 
complex processing occurs within the behavior 
modules, rather than between them. (Some special- 
ized subsumption networks have l^n designed 
with recurrent links; those for the control of walk- 
ing, for example.) 

The subsumption planner allows for three general 

forms of information flow. The sensor subsystem 


contains an afferent abstracting flow, and the mo- 
tor subsystem contains a de-abstracdng efferent 
flow. The subsystem between these two, cominis- 
ing the decision making mechanism, is constructed 
as a network which allows a great number of in- 
formation flows in both forward and backward di- 
rections. The forward flows are essentially identi- 
cal to those of the ctmventional subsumption archi- 
tecture and represent reactive behavior such as 
tracking or avoidance. The backward flows create 
loops in the information flow; these recurrent 
flows generate the visualization of future actions 
and planning. These recurrent flows are model 
geno-ated predictions which are fed forward (i.e., 
in the same direction as information in the real 
world — from effectors to sensors) toward the sen- 
sory subsystem and treated as imaginary, virtual 
sensory situations. The key point is that the inter- 
nal modeled information flow mimics the outside 
world’s physical cause-effect information flow. 
Multiple iterations of this loop propagate predic- 
tions farther into the future, although any modeling 
errors will accumulate. These sensory situations 
are ordered in time and represent plans for future 
behaviOT. 

In the badidonal subsumption architecture, the re- 
sponsibility for reacting to the environment is de- 
composed and delegated to a number of reactive 
bdiavior modules. A behavior module can be seen 
as an agent which is an expert in one discrete do- 
main of behavior. These modules in turn may have 
hardwired managerial control over an entire net- 
work of subordinate behavior modules. The con- 
trol is manifested as subsumption, whereby the 
manager commandeers control over the robot 
whenever it wants. The manage' handles special 
situations only; the common situations are ignored 
by the manage- and handled by its subordinates. If 
the speciflc triggering situation for the manager 
does not exist, some subordinate, more generalized 
behavior probably has been triggered by a less 
specific sensory situation. At the lowest level, a 
behavior may be continuously triggered by default; 
in effect, it is triggoed by any sensory situation. 

4. Extensions to Subsumption 

In order to implement the subsumption planner, 
four extensions to the traditional subsumption ar- 
chitecture are needed: 

• Sensor and motor space abstraction 

• Virtual future sensor space 

• Cause-effect predictors: augmentation of 
behavior modules 









• Hierarchical CH’ganization: object-oriented 
implementation of arbitration 

Sensor and motor space abstractions extend the 
real-world interface (inpul/output) information 
spaces, allowing for more sophisticated, abstract 
triggering and action-generating mechanisms for 
the behavior modules. Virtual future sensor space 
provides a scratchpad for construction of plans. 
Cause-effect predictors are associated with the be- 
havior modules and generate the plans themselves 
as trajectories in the virtual sensor space. The hier- 
archical organization is an alternate object-oriented 
implementation of the subsumption architecture. 

4.1 Sensor and Motor Space Abstraction 

Sensor space is a very-high-dimensionality repre- 
sentation of the information entering the robot 
from its external sensors. After processing and 
combining raw sensor data, abstract sensor data 
containing composite information in a more easUy 
assimilated form, or a form with higher informa- 
tion density, can be created. These abstractions 
have been called “logical sensors”^^. They are 
generated through the fusion of raw sensor data. 
Examples in robot vision include edge detectors, 
novelty detectors, motion detectors, or highly ab- 
stract sensors such as face recognizers. Cross- 
modal abstractions sense location or orientation 
using a number of sensor sources to reduce uncer- 
tainty. A logical sensor might determine location 
by combining information gleaned from a numb^ 
of raw sensors, such as sonar, visual clues, radio 
landmarks, etc. 

In the original subsumption architecture, behaviors 
are triggered directly from raw sensor inputs. The 
entire extended sensor space including raw sensor 
data and abstract, derived sensor data, is available 
to trigger the subsumption planner’s behavior 
modules. Any particular behavior module is sensi- 
tive only to a small localized subset of this sensor 
space, similar in some measure, for example a 
small visual patch. The sensitivity of a behavior 
module is also localized in level of abstraction — a 
simple, low-level behavior may trigger on contact 
with a single touch sensor; a more abstract, higher- 
level managerial behavior on recognition of a 
complex object such as a familiar face. 

The abstract sensors are derived from the raw sen- 
sors; the raw sensor space by itself is complete. 
The abstract sensor space extension is a conceptual 
device to provide behavior modules with a com- 
mon pool of sensed information from which to 
trigger. A behavior module, rather than triggoing 
on a complex pattern which could potentially be 


multi-sourced and multi-modal (as well as uncer- 
tain and noisy) in raw sensory space, may trigger 
on a simple abstract pattern, or even a single 
highly processed attribute in abstract space. 

These high dimensional spaces are conceptual 
constructs. Obviously, to implement them faith- 
fully following the theory is extremely wasteful 
and probably impossible. The implementation can 
be drastically optimized while still following the- 
ory. For example, to conserve resources, the ab- 
straction of raw sensors and data fusion should 
only be performed by request 

Complex motor actions generated by the motor 
subsystem are also abstracted into motor pattern 
generators (called “central pattern generators” by 
neuroscientists). Locomotion is a repetitive ab- 
stract motor pattern which is compris^ of a num- 
ber of individual motor actions. The relatively 
complex repetitive pattern of actions are generated 
by a single manager behavior which directs sim- 
pler, lower-level behaviors to generate the rudi- 
mentary component actims. Variaticms of locomo- 
tion, such as speed, gait, direction, can be siqtplied 
to the abstract locomotion behavior as parameters. 

4.2 Virtual Fu tura Sensor Space 

Sensor space, extended in dimensionality by the 
incorporation of abstract sensor dimensions, is 
also extended orthogonally to represent a time di- 
mension — the extension of the sensed environment 
into the future. This extension will be used to pro- 
vide a working scratchpad for plan genoation. 

This extension provides a conceptual framework 
for the constructicxi of plans. A plan is a visualiza- 
tion of a sequence of events. The most complete, 
natural, and effrcient way (fw that matter, the only 
possible way) to represent a visualization is in 
terms of the same modality in which it will actu- 
ally be sensed. Since the extended sensor space 
contains all possible sensory situations, the visual- 
ization of “how it will ^pear to the senses” is also 
contained. The difference between real-time sen- 
sory situations and virtual sensory situations is in 
completeness (everything need not be repre- 
sented), resolution (irrelevant details may be 
omitted), and uncertainty (multiple possible values 
or fuzziness). 

The plans generated by the predictors are repre- 
sent^ as a sequence of nodes or waypoints, rela- 
tively firmly located in state space, at distinct fu- 
ture times, connected by indefinite links. As the 
plans mature or become more imminent (the plans 
are also refined by the predictors), they become 


more flim in location, and intennediate waypoints 
emerge. 

Multiple alternative plans are tagged with plan 
quality measures. The plans are evaluated by the 
predictor. When a choice of plan is necessary (i.e., 
when a complete replan is necessary, or upon em- 
baridng on a new t^), the highest quality plan is 
selected. The robot will thra begin executing this 
plan. 

4.3 Cause-Effect Predictors 

Predictors are agents associated with behavior 
modules. The original subsumption style action- 
genoating bdiavior module, which we now call a 
reactor, is triggered by a sensory situation in the 
current sensor subspace. By augmenting the behav- 
ior module with a predictive mechanism, called the 
predictor, the effect of the reactor on the environ- 
ment can be predicted. The predictor is triggered 
by the same sensory situation as the reactor in the 
current or any future sensory subspace. The predic- 
tor then generates a trajectory from the trigger 
point to the future point representing the predicted 
resultant effect of performing the behavior. 

Predictors are not required for low-level, reflexive, 
or protective behaviors such as obstacle avoidance 
or tracking. These behaviors avoid anomalous 
states, such as being stuck in a ccxn»; they guaran- 
tee that the robot remains in a nominal space. 
Higher level behaviors can expect that the anoma- 
lous states will be avoided. The lowest level be- 
haviors are purely reactive. This reliance on low- 
level behavioral guarantees make the prediction 
job of the higher level behaviors easier. In general, 
predictors are necessary only for manager behav- 
iors. 

The predictor is ins^sitive to its trigger source — 
whether it is actually being sensed at the present 
moment, tv if the sensory situation is an imaginary 
construction of some previously triggered predic- 
tor. The idea is to visualize a sequence of events 
using exactly the same sensory computational 
pathway as would be stimulated during the actual 
performance of the sequence. The differences are 
that: 1) the actual raw sensor transducers are not 
stimulated, 2) conceptually, the behaviors are dis- 
placed along the time dimension of the sensor 
space, and 3) the reactors do not generate motor 
commands. Tlie same mechanisms used for reac- 
tive behavior (the trigger mechanisms and the arbi- 
tration of actions) are used for planning. 

Obviously, the prediction is defined only in the 
subspace in which the reactor may potentidly have 
a repeatable, predictable effect — a reactor which 


closes the robot’s grippo* will, in genoal, have no 
predictable, correlatable effect on sensed ambient 
light intensity. 

As in schema theory, a subsumption planner pre- 
dictor is a black box. The internal mechanism of 
the predictor is not important to the functioning of 
the entire distributed system; only the externally 
observable functional behavior (i.e., its role in the 
architecture) must be well defln^. The implemen- 
tation may use a procedural algorithm, a neural 
networic, specialized hardware, or any other tech- 
nique to perform its function. 

4.4 Hierarc hical Organization 

While planning, no ouq>uts should be generated 
fipom the activity of the behavior modules. The re- 
actor does not generate output if the trigger is in 
the virtual srasor space. However, since the sub- 
sumption architecture’s arbitration network (the 
coUectitm of output wires along with their connec- 
tors) are connected to reactors which genoate out- 
puts whenever they are triggered, it is difficult to 
determine the actual controlling behavior module 
without actually generating motor ouq>uts to send 
through the netwtvk. This would require some s(xt 
of action gating mechanism at the output of the ar- 
bitration network to inhibit the passage of motor 
commands. The result of the arbitration needs to 
find its way back to the predictor also. 

The cotmectivity of the subsumption network de- 
fines its global behavior. Behavioral modules 
which are triggered upon sensing special oiviron- 
mental situations subsume lower level behaviors 
(if the subsuming behavior was not triggered by a 
special case of the subsumed behavior, the sub- 
sumed behavior would not be triggered in the first 
place, and the subsumption relationship would be 
inappropriate, i.e., in a propaly designed network 
the subsumed behavior should already be triggered 
and operating when the subsuming behavior takes 
over). In a sense, the managa* behavior comman- 
deers control of the entire system because it be- 
lieves it is best suited (in the eyes of the designer) 
to deal with the problem at hand. 

The triggoing of a module does not automatically 
cause the robot to perform the actions genoated by 
this module. Several behavior modules may be 
triggered by the same sensory state. The actions 
must be mediated by the subsumption or arbitra- 
tion network. This network detennines which be- 
havior may assume control of the entire system at 
any one time. In genoal, behaviors which are trig- 
gered by a smallra*, more exclusive region within 
die sensory space will be regarded as more appro- 
priate than more generic behaviors. The result is 


that behaviors which trigger upon special situa- 
tions, or exceptions to the general rule, will have 
priority over other bdiaviors. These more sophisti- 
cated, specialized manager behaviors “subsume” 
lower-level, or simpler behaviors. 

In order to retain “ownCTship” of the motor com- 
mands by the generating behavior, an alternative is 
to perform the arbiuation before the commands are 
output by the reactors. By doing this, the behavior 
module knows that it has been selected (on the ba- 
sis of either real or virtual senses) as the controll^ 
of the entire system, and the predictor can act ac- 
cordingly. If the trigger source is the virtual sensor 
space, the arbitration still occurs, and the result is 
known locally to the behavior module. 

The subsumption architecture triggers all eligible 
(evCT inappropriate) behavior modules to generate 
actions in pandlel. By giving each module some 
awareness of its own place in the control hierar- 
chy, and some visibility into the activity of behav- 
iors which may override its own eligibility, the 
numbo* of eligible, active modules may be reduced 
to those which may actually control the robot. 
(Note that several subsumption networks may con- 
trol a robot, so that several different behavior 
modules may be generating actions at the same 
time for different subsystems). This type of hierar- 
chical selection lends itself to the use of schema 
theory and object-oriented software techniques. 

The arbitration network composed of suppression, 
inhibition, and default nodes in the subsumption 
architecture functions similarly to the schema as- 
semblage construct in schema theory. A schema 
assemblage is a collection of (possibly intercon- 
nected) schemas and is itself regarded as yet an- 
other schema. The component schemas are defini- 
tions rather than instantiations, so that multiple in- 
clusions in different schemas are realizable. This 
self-similar hioarchical stmcture lends itself to ob- 
ject-oriented software construction techniques. 
Object classes are built from less specific classes 
as specializations of those classes in the same way 
that subsuming behaviors are triggered by special 
cases of the triggering stimuli of subsumed behav- 
iors. 

The partial world model relevant to a subset of be- 
havioral modules is contained in the module which 
manages that subset. This is the highest-level 
module contained in that subset. For example, a 
behavior called go-to(elevator) which goes to the 
elevator in a building given the current locatitxi 
must direct lower level behaviors to orchestrate a 
sequence of behaviors (i.e., make managerial deci- 
sions) which are triggered at intersections, such as 
turn(left), turn(right), and turn(straight). The 


go-to behavior contains the necess^ world 
knowledge to get to a known landmaik, in this case 
the elevator. The managed behavior (turn) con- 
tains no world knowledge of that nature. Its world 
knowledge is only that which it needs to know to 
perform a turn successfully. 

5. Maze Navigation 

Navigation through a maze provides a clear and 
simple example of the generation and execution of 
plans. In this case, the successful path through the 
maze is directly representable as a path through 
state space, the relevant state in this case being lo- 
cation. A navigation plan is a predetermined se- 
quence describing the choices made at each deci- 
sion point, i.e., at forks in the road or intersections. 
This can te represented as a travosal of a decision 
tree 0P»g- 2). 


start 



Upon encountering a decision point, such as an 
intosection, the subsumption architecture relies on 
a prepr ogrammed, rote strategy to select one of the 
alternatives. This arbitrary tie-breaker rule may be 
“always head south”, or “follow the left wdl”. 
These can be successhil strategies for maze navi- 
gation, but they are rarely optimum. 

The best plans are based on previous experience. 
Searches for optimal paths through state space 
have been studied extensively in artificial intelli- 
gence. These require a cost criterion as well as a 
guide to choices available, i.e., a map. A map sim- 
ply rq>resents the accumulated expaience of the 
mapmaker. Heuristics may be used (i.e., stay on 
the main path, or go straight, until you have a good 


reason not to) in the absence of speciflc mjq) 
knowledge. 

This search for an optimum path creates a plan 
based on the expectations of the planner. If the 
domain has changed, or the map is incorrect, the 
planner must generate a new plan. The new plan, 
in order to be optimal, may require some l^k- 
tracking, or it may start from the current state^. 
The subsumption planner replans, without explicit 
backuacking, from the moment it senses a dis^p- 
ancy between expectation and reality. The plan 
gen^ted may include as its first segment a back- 
track; howevCT, from the point of view of the new 
plan, it is not backtracking, but rather generating a 
new complete plan from the current state. 

The current location and orientation of the robot is 
represented as a point in (abstract) state space. 
Subspaces representing the sensors and their ab- 
stractions which are used to determine location, 
orientation, obstacles, and other prop^es impOT- 
tant to navigation and travel, will contain the trig- 
gers for these behaviors. A behavior module is 
triggered when the current state lies within some 
specific trigger region. For example, the region of 
sensor state q>ace which rq>resents the situation in 
which a large object is directly in front of the robot 
should trigger an obstacle avoidance behavior 
module which causes a turn. 

5.1 Path Diannina 

A goal is the desired end state of a behavior. This 
definition of a goal assumes that the robot has 
achieved the purpose for its existence when this 
end state has been reached. This is only the case 
for simple robots and well-defined behaviors. In 
general, each subtask also has a goal, and each 
sub-subtask has goals. Similarly, the goal for the 
entire behavior is really a subgoal of a larger con- 
textual plan, which may only be implicit in the de- 
sign of a robot. For example, a robot’s explicit 
goal at the topmost level may be to achieve a 
mowed lawn. This is a subgoal of the implicit task 
“keep the lawn mowed” which requires that the 
robot rqpeat the lawn mowing subtask whenever 
the grass gets too long. Real plans thus have a hi- 
erarchical, self-similar nature where simpler sub- 
tasks look very similar to the contextual task. Thus 
we can treat contextual plans in the same way as 
subtasks. 

The decision tree which represents all possible 
paths (starting from the current state) to a goal is 
the problem space within which the navigation 
plan must be constructed. This problem space is a 
virtual scratchpad for the subsumption planner. 
The root of this tree rqjresents the system’s current 


location. The planner visualizes a future course of 
action which is expected to reach a goal. 

The planner makes its decisions based on general 
knowledge of the problem domain, which it has 
accumulated by expmence or by design. The ben- 
efit of using a planner over simple reactive behav- 
ior is that knowledge which is more global in na- 
ture than that available at the decision point may 
be applied to solve the problem in (hopefully) a 
more efficient manner. This more-global knowl- 
edge is in fact a partial world model. The premise 
of the subsumption planner is that partial world 
models may be distributed across the system as 
cause-effect predictors, associated with the action- 
generating behaviors which are capable of steering 
the robot into a desired state. 

It is the responsibility of the cause-effect predic- 
tors to propagate the decision tree into the future to 
determine the best plan. Unlike a game theoretic 
min-max planner, in which the goal is to win the 
game, not to reach any specific game state, the 
subsumption planner can create a much more spe- 
cific visualization of the goal. A chess playing al- 
gorithm can only “visualize” the winning goal 
state by forward chaining from the current state; 
there are a great number of possible winning 
states. It does not select a goal state and attempt to 
genmte a plan which achieves it The chess play^ 
tries a great number of state trajectories until a 
completely defined state is achieved which belongs 
to the subset of checkmate (goal) states. To the 
general purpose planner, however, most specifics 
of the goal state are irrelevant and can safely be 
ignored. 

In the case of the maze navigator, the only aspect 
of the goal state which is relevant is the location. 
All other aspects of the goal state can be ignored. 
Since the goal state may be so loosely defined, the 
planner can restrict its search for potential behav- 
iors to only those whose effect is to cause the robot 
to change location. 

The most genial behavior which can cause this ef- 
fect may be called the travel behavior. This be- 
havior may direct, manage, or simply allow (but 
not micro-manage!) the action of subs^ient be- 
haviors which cause forward motion, turns, obsta- 
cle avoidance, etc. 

The entire course of action of the travel behavior 
need not be plaimed in great detail in advance. The 
robot need only realize that travel is capable of 
poducing the desired effect. Once this is known, 
the robot depends on travel to complete the entire 
task of getting to the goal. The rq)eitoire of behav- 


iors travel has at its beck and call will perform 
their own planning as needed. 

The travel behavior makes its decisions in the 
context of its view of the sensor space. It delegates 
responsibility for selecting a sequence of turns at 
intersections to the navigate-maze behavior when 
it senses the domain (walls and passageways) 
through which it must travel. (An alternate behav- 
ior might be navigate-meadow, a meadow is a 
term for a large area where there is a possibility 
that proximity sensor bearings may lost, so 
other navigation techniques, such as dead-reckon- 
ing, are called for.) 

The navigate-maze behavior selects an active 
subordinate behavior based on its “map” (a repre- 
sentation of its expectations) or on heuristics if the 
map does not t^ply to the current situation. It does 
this by examining relevant elements of the real or 
virtual state space (orientation and location) and 
associating with that trigger state a “best” reaction. 
The “map” is thoefore in the form of an associator 
which generates some reaction based on a set of 
inputs. This associator represents a partial world 
model. Each managoial behavior module contains 
an associator which embodies only the knowledge 
relevant to that behavior. 

The plan, rqnesented as a trajectory in state space, 
has associated with it a quality indicator. When the 
state arrives at a decision point in real execution, 
the decision it makes is the one associated with the 
highest quality indicator. If sensors indicate a de- 
viation from the expected sensor state as contained 
in the virtual sensor space, the quality of trajecto- 
ries which are affected by that deviation is re- 
duced. Alternate plans may or may not be gener- 
ated at that point. At the point of departure from 
expectations (i.e., the current state), the robot 
m^es a decision according to the new highest 
quality plan. 

5.2 An Example 

Assume that a maze is to be navigated as in Fig. 3. 
The robot will start at location s and is to find its 
way to goal location g. There are two paths. The 
shortest path, right at a and left at b, is blocked by 
an obstacle which cannot be sensed until past b. 



The subsumption planner contains a travel behav- 
ior capable of physically relocating the robot. This 
manage behavior controls subordinate behaviors 
navigate-maze and navigate-meadow. In the sub- 
sumption architecture, it would do this by inhibit- 
ing output from the undesired behavior. Another 
behavior, follow-corridor, is a behavior triggered 
upon sensing walls on either side and clear space 
in front It simply travels along the midline of the 
corridor. The turn bdiavior is a composite behav- 
ior, consisting of a turn-manager and four types 
of subordinate turn generating behaviors, 
turn(left), turn(right), turn(straight), and 
tum(back). These turns are dq)endent on orienta- 
tion; turns dependent on compass direction could 
be used identically. The turn-manager contains a 
tum-associator which is the mechanism containing 
the world model for the navigation domain. Since 
navigation in the maze world is performed by de- 
ciding which way to turn at intersections, the tum- 
nuq> in this case resides in turn-manager. Another 
partial model, the corridor-length-map, provides 
expectations of distance to the foUow-corridor 
predictor. Other low level modules, not discussed 
here, prevent collisions with obstacles, keep the 
robot in the center of the hall, keep the robot 
moving, etc. 

The robot will behave as follows: 

1) The robot begins at location s. The fol- 
low-corridor reactor begins to generate actions as 
in the traditional subsumption architecture, and the 
robot starts moving down the hall. It moves toward 
location a. 

2) At the same time that the reactor begins 
generating actions, the travel behavior is triggered 
by some higher level behavior to start the planning 
process. The travel behavior assumes responsibil- 
ity for moving the robot from sto g. The predictor 
generates a simple state space trajectory between s 
and g. Travel allows navigate-maze to assume re- 


^nsibility for completing the plan and its execu- 
tion. Only navigate-maze is triggered by the ab- 
stract maze/meadow sensor. The follow-corridor 
sensor is also triggered by the abstract corridor 
srasor. The follow-corridor predictor generates 
the expected location a based on its knowledge of 
corridor length. In the virtual sensor map, a is 
stored as a waypoint to be reached at time /q The 
virtual sensor space at a indicates the sensory situ- 
ation of a 4-way intersection. The turn-manager 
behavior is virtually triggered by an abstract inter- 
section sensor. 

3) The turn-manager associator senses 
global location a. The global location sensor is ab- 
stracted from a variety of fused raw sensors, po'- 
haps including visual clues, wheel odometers, ^- 
tial sensors such as sonar or rangefinder, and any 
other sensors which distinguish this location as 
different from others in any way. The associator 
outputs “turn-right” as the highest quality plan, fol- 
lowed by “turn-straight” as the next highest Note 
these are not behaviors but rather decisions on 
which behavior should be activated as part of the 
plan. Other alternatives have a quality indication of 

ZCTO. 

4) The planner is now done until the robot 
reaches a, since all details for the plan segment 
from s to a are complete. The turn manager in- 
hibits its own triggering until ready to resume 
planning. 

5) The robot reaches a. Since the prediction 
turned out to be accurate, the top quality plan has 
not changed. This plan indicates the behavior 
tum(right) should be executed next. Turn right is 
selected by turn-manager as active, and the pre- 
dictor generates a’ as the next waypoint, a' virtu- 
ally triggers follow-corridor, and that predictor 
generates waypoint b. Virtual sensor situation b 
then triggers turn-manager, which inhibits itself 
from making a commitment until b is physically 
reached. The planner waits for the robot to catch 
up. 

6) At b, the turn-manager associates gen- 
erates the decision “turn-left”. Robot starts the 
turn(left) behavior using the reactor mechanism. 
But, an unexpected obstacle is sensed. The blocked 
path causes the entire trajectory quality to go to 
zero. The expectation violation is made avail- 
able to any behavior. 

7) The robot is still near b. The navigate- 
maze behavior notices the expectation violation. 
Its reaction is to replan. It presets that it can still 
reach the goal, since there is a second, albeit Iowa* 
quality, plan available from a previous waypoint. 


All motion-generating behaviors are allowed to 
trigger from the real sensor state b. Follow-corri- 
dor has the best quality plan, a direct trajectory to 
end up near a. The ro^t now moves under direc- 
tion of follow-corridor. 

8) At a ’ , c, and d, decisions are made in the 

same manner as above. Finally, the robot moves to 
g. Travel has completed its visualized plan, along 
with all its subordinates. The travel behavior ter- 
minates, and current state g may be used to trigger 
the next behavior. 

6. Conclusion 

This paper has described an extension to the sub- 
sumption architecture which implements planning 
using a virtual sensor space as a planning tool. The 
planner, given a goal, gen^tes a new plan, moni- 
tors execution of the plan, and replans in the event 
of a discrepancy between expectation and reality. 
The planner operates as a distributed network in 
parallel with the existing subsumption network. 
Hie theoretical motivation for this technique was 
presented. An example in the domain of maze nav- 
igation showed its operation in a simple applica- 
tion. 
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Abstract 

This paper addresses a central issue in robot 
construction, namely the control of deliberation 
time. The complexity of automated planning and 
scheduling makes it undesirable, sometimes in- 
feasible, to find the optimal action in every situa- 
tion since the deliberation process itself degrades 
the performance of the system. The question is 
how can an intelligent robot react to a situation 
after performing the “right” amount of thinking. 

It is by now widely accepted that a successful 
robotic system must trade off between decision 
quality and the computational resources used to 
produce it. Anytime algorithms, introduced by 
Dean, Horvitz and others in the late 1980’s, were 
designed to offer such a trade-off. Recent work by 
Zilberstein and Russell shows that the advantages 
of anytime algorithms can be extended to the con- 
struction of complex robotic systems. This paper 
describes the compilation and monitoring mech- 
anisms that are required to build robots that can 
efficiently control their deliberation time. 

I. Control of Deliberation Time 

Intelligent robots must perform real-time deliberation 
to solve such problems as path planning, task scheduling, 
and interpretation of sensory data. An important aspect 
of intelligent behavior is the capability of robots to factor 
the cost of deliberation into the deliberation process. Two 
factors determine the cost of deliberation: the resources 
consumed by the process, primarily computation time, and 
constant change in the environment that may decrease the 
relevance of the outcome and hence reduce its value. A 
useful mechanism to quantify this dependency is based on 
the definition of a utility function U (5) over the states of 
the world. The utility of a state defines the desirability of 
that state. For example, the utility of a robot that assem- 
bles a certain product can be measured by the number of 
products completed each hour. Utility functions extend the 


traditional notion of deadline (allowing for gradual decrease 
of value over time) and the traditional notion of goals (al- 
lowing for partial goal satisfaction). Thus, they are more 
suitable for control of real-time robotic systems. 

To choose an optimal course of action and to maximize 
its utility function, a robot must perform some real-time 
problem solving. The performance of robotic systems can 
be improved by optimizing the quality of their decisions 
net of deliberation cost. The problem of deliberation cost 
has been widely discussed in economics, engineering and 
artificial intelligence. In artificial intelligence, researchers 
have proposed a number of meta-level architectures to con- 
trol the cost of base-level reasoning [3, 9, 14]. The model 
presented in this paper belongs to this class of solutions: 
its meta-level reasoning component optimizes resource al- 
location to the base-level performance components. This 
approach separates two, central aspects of robot construc- 
tion: the development of the performance components and 
the optimization of performance. This modularity is ac- 
complish by using anytime algorithms as the elementary 
components of the system. 

The rest of the paper describes our approach in de- 
tail. Section II describes the notion of anytime algorithms. 
It shows how to construct anytime algorithms and how to 
characterize the trade-off that they offer between quality of 
results and computation time. Section III explains the ben- 
efits and difficulties involved in the composition of anytime 
algorithms. Sections IV and V describe the two main com- 
ponents of our solution to the composition problem, namely 
off-line compilation and run-time monitoring. Section VI 
describes briefly some applications of this approach. Fi- 
nally, Section VII summarizes the benefits of our approach 
and discusses some directions for further work. 

II. Anytime Algorithms 

The term “anytime algorithm” was coined by Dean 
in the late 1980’s in the context of his work on time- 
dependent planning. Anytime algorithms are algorithms 
whose quality of results improves gradually as computa- 
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tion time increases, hence they offer a tradeoff between 
resource consumption and output quality. Many numerical 
approximation methods, such as Taylor series approxima- 
tion, are based on iterative improvement and, as such, can 
be considered an anytime algorithm. 

Various metrics can be used to measure the quality 
of a result produced by an anytime algorithm. From a 
pragmatic point of view, it may seem useful to define a 
single type of quality measure to be applied to all anytime 
algorithms. Such a unifying approach may simplify the 
meta-level control. However, in practice, different types 
of anytime algorithms tend to approach the exact result in 
completely different ways. The following metrics have 
been proved useful in anytime algorithm construction; 

1 . Certainty - this metric reflects the degree of certainty 
that the result is correct. The degree of certainty can be 
expressed using probabilities, fuzzy set membership, 
or any other approach. 

2. Accuracy - this metric reflects the degree of accuracy 
or how close is the approximate result to the exact 
answer. Normally with such algorithms, high quality 
provides a guarantee that the error is below a certain 
small upper bound. 

3. Specificity ~ this metric reflects the level of detail of 
the result. In this case, the anytime algorithm always 
produces correct results, but the level of detail is in- 
creased over time. 

Many existing programming techniques produce use- 
ful anytime algorithms. Examples include iterative deepen- 
ing search, variable precision logic, and randomized tech- 
niques such as Monte Carlo algorithms or fingerprinting 
algorithms. For a survey of such programming techniques 
and examples of algorithms see [21]. 

The notion of interrupted computation is almost as old 
as computation itself. However, traditionally, interruption 
was used primarily for two purposes: aborting the execution 
of an algorithm whose results are no longer necessary, or 
suspending the execution of an algorithm for a short time be- 
cause a computation of higher priority must be performed. 
Anytime algorithms offer a third type of interruption: inter- 
ruption of the execution of an algorithm whose results are 
considered “good enough” by their consumer. 

Conditional performance profiles 

To allow for efficient meta-level control of anytime 
algorithms, we characterize their behavior by conditional 
performance profiles (CPP) [19]. A conditional perfor- 
mance profile captures the dependency of output quality 
on time allocation as well as on input quality. In [21], the 
reader can find a detailed discussion of various types of con- 
ditional performance profiles and their representation. To 
simplify the discussion of compilation, we will refer only 



Figure 1 : Graphical representation of a CPP 


to the expected CPP that maps computation time and input 
quality to the expect output quality. 

Definition 1 The conditional performance profile (CPP), 
of an algorithm A is a function CP Pa : Qin x ^ 
Qout that maps input quality and computation time to the 
expected quality of the results. 

Figure 1 shows a typical CPP. Each curve represents 
the expected output quality as a function of time for a given 
input quality. 

Interruptible and contract algorithms 

In [15] we make an important distinction between two 
types of anytime algorithms, namely interruptible and con- 
tract algorithms. An interruptible algorithm can be inter- 
rupted at any time to produce results whose quality is de- 
scribed by its performance profile. A contract algorithm 
offers a similar trade-off between computation time and 
quality of results, but it must know the total allocation of 
time in advance. If interrupted at any point before the ter- 
mination of the contract time, it may yield no useful results. 
Interruptible algorithms are in many cases more appropri- 
ate for the application, but they are also more complicated 
to construct. In [15] we show that a simple, general con- 
struction can produce an interruptible version for any given 
contract algorithm, with only a small, constant penalty. 
This theorem allows us to concentrate on the construction 
of contract algorithms for complex decision-making tasks 
and then convert them into interruptible algorithms using a 
standard transformation. 

111 . Composing Anytime Algorithms 


Modularity is widely recognized as an important issue 
in system design and implementation. However, the use of 
anytime algorithms as the components of a modular system 
presents a special type of scheduling problem. The question 
is how much time to allocate to each component in order 
to maximize the output quality of the complete system. We 
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Figure 2: A composite module for speech recognition 


refer to this problem as the anytime algorithm composition 
problem. 

Consider for example a speech recognition system 
whose structure is shown in Figure 2. Each box represents 
an elementary anytime algorithm whose conditional perfor- 
mance profile is given. The system is composed of three 
main components. First, the speaker is classified in terms of 
gender and accent. Then a recognition algorithm suggests 
several possible matching utterances. And finally, the lin- 
guistic validity of each possible utterance is determined and 
the best interpretation is selected. The composition prob- 
lem is the problem of calculating how much time to allocate 
to each elementary component of the composite system, so 
as to maximize the quality of the utterance recognition. 

Solving the composition problem is important for sev- 
eral reasons. First, it introduces a new kind of modu- 
larity into real-time system development by allowing for 
separation between the development of the performance 
components and the optimization of their performance. In 
traditional design of real-time systems, the performance 
components must meet certain time constraints that are not 
always known at design time. The result is a hand-tuning 
process that, may or may not, culminate with a working 
system. Anytime computation offers an alternative to this 
approach. By developing performance components that are 
responsive to a wide range of time allocations, one avoids 
the commitment to a particular performance level that might 
fail the system. 

The second reason why the composition problem is 
important relates to the difficulty of programming with any- 
time algorithms. To make a composite system optimal (or 
even executable), one must control the activation and in- 
terruption of the components. In solving the composition 
problem, our goal is to minimize the responsibility of the 
programmer regarding this optimization problem. Our so- 
lution is described in the following two sections. 

IV. Compilation 

Given a system composed of anytime algorithms, the 
compilation process is designed to: (a) determine the opti- 
mal performance profile of the complete system; and (b) 
insert into the composite module the necessary code to 


achieve that performance. The precise definition and solu- 
tion of the problem depend on the following factors: 

1 . Composite program structure - what type of pro- 
gramming operators are used to compose anytime al- 
gorithms? 

2. Type of performance profiles - what kind of per- 
formance profiles are used to characterize elementary 
anytime algorithms? 

3. Type of anytime algorithms - what type of elemen- 
tary anytime algorithms are used as input? what type 
of anytime algorithm should the resulting system be? 

4. Type of monitoring - what type of run-time monitor- 
ing is used to activate and interrupt the execution of 
the elementary components? 

5. Quality of intermediate results - what access does 
the monitoring component have to intermediate re- 
sults? is the actual quality of an intermediate result 
known to the monitor? 

Depending on these factors, different types of compi- 
lation and monitoring strategies are needed. To simplify the 
discussion in this paper, we will consider only the problem 
of producing contract algorithms when the conditional per- 
formance profiles of the components are given. We will as- 
sume that no active monitoring is allowed once the system 
is activated. A broader, in-depth analysis of compilation 
and monitoring can be found in [21]. 

Let .F be a set of anytime functions. Assume that all 
function parameters are passed by value and that functions 
have no side-effects (as in pure functional programming). 
Let X be a set of input variables. Then, the notion of a 
composite expression is defined as follows: 

Definition 2 A composite expression over T with input X 
is: 

L An expression /(zi, Zn) where f E T is a function 
ofn arguments and zi , Zn ^ X, 

2. An expression f{gi , ^n) where f ^ T is a function 
ofn arguments and each Qi is a composite expression 
or an input variable. 

For example, the expression A[B{x).,C{D{y))) is a 
composite expression over {A, C, D} with input {x, y}. 
Suppose that each function in !F has a conditional perfor- 
mance profile associated with it that specifies the quality of 
its output as a function of time allocation to that function 
and the qualities of its inputs. Given a composite expres- 
sion of size n, the main part of the compilation process is 
to determine a mapping: 

r ^ (<i,...,tn) (1) 

This mapping determines for each total allocation, t, the 
allocation to the components that maximizes the output 
quality. 
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Figure 3: The performance profiles of A\ and A 2 

A compilation example 

Let us look first at a simple example of compilation 
involving only two anytime algorithms. Suppose that one 
algorithm takes the input and produces an intermediate re- 
sult. This result is then used as input to another anytime 
algorithm which, in turn, produces the final result. Many 
systems can be implemented by a composition of a sequence 
of two or more algorithms. For example, an automated re- 
pair system can be composed of two algorithms: diagnosis 
and treatment. This can be represented in general by the 
following expression: 

Output A 2 {Ai{Input)) 

Figure 3 shows the performance profiles of A\ and 
A 2 . These performance profiles are defined by: 

Qi{t) = l-e-^^^ Q^{t) = I - 

Assume that the output quality is the sum of the qualities of 
Ai and A 2 , then the following result holds: 

Theorem 3 Given the performance profiles ofA\ and A 2 , 
the optimal time allocation mapping is: 

^ / In Aj — In A2 H- A2^ In A2 — In Ai + Ai^ ^ 

Ai + A2 ’ Ai + A2 ^ 

Proof: Since the overall output quality is: 

Q(x) = 1 - + 1 - ( 3 ) 

the maximal quality is achieved when ^ = 0. 

In other words: 

Aie"^>* - A2e-^"(‘-^) =0 (4) 

The solution of this equation yields the above allocation. □ 

To complete the compilation process, the compiler 
needs to insert code in the original expression for proper 
activation of A\ and A 2 as contract algorithms with the 
appropriate time allocation. This is done by replacing the 
simple function call by an anytime function call [21]. The 
implementation of an anytime function call depends on the 
particular programming environment and will not be dis- 
cussed in this paper. 

The complexity of compilation 

The compilation problem is defined as an optimization 
problem, that is, a problem of finding a schedule of a set 


of components that yields maximal output quality. In order 
to analyze its complexity, it is more convenient to refer to 
the decision problem variant of the compilation problem. 
Given a composite expression e, the conditional perfor- 
mance profiles of its components, and a total allocation 
By the decision problem is whether there exists a schedule 
of the components that yields output quality greater than 
or equal to K. To begin, consider the general problem 
of global compilation of composite expressions, or GCCE. 
In [21], we prove the following result: 

Theorem 4 The GCCE problem is NP-complete in the 
strong sense. 

The proof is based on a reduction from the PAR- 
TIALLY ORDERED KNAPSACK problem which is 
known to be NP-complete in the strong sense. The mean- 
ing of this result is that the application of the compilation 
technique may be limited to small programs. To address the 
complexity problem of global compilation, we developed 
an efficient local compilation technique. 

Local compilation 

Local compilation is the process of finding the best 
performance profile of a module based on the performance 
profiles of its immediate components. If those components 
are not elementary anytime algorithms, then their perfor- 
mance profiles are determined using local compilation. Lo- 
cal compilation replaces the global optimization problem 
with a set of simpler, local optimization problems and re- 
duce the complexity of the whole problem. Unfortunately, 
local compilation cannot be applied to every composite ex- 
pression. If the expression has repeated subexpressions, 
then computation time should be allocated only once to 
evaluate all identical copies. Local compilation cannot han- 
dle such cases. However, the following three assumptions 
make local compilation both efficient and optimal [21]: 

1 . The tree-structured assumption - the input compos- 
ite expression has no repeated subexpressions, thus its 
DAG (directed acyclic graph) representation is a tree. 

2. The input-monotonicity assumption - the output 
quality of each module increases when the quality of 
the input improves. 

3. The bounded-degree assumption - the number of 
inputs to each module is bounded by a constant, h. 

Under these assumptions, local compilation is both ef- 
ficient and yields optimal results [21]. The first assumption 
is needed so that local compilation can be applied. The 
second assumption is needed to guarantee the optimality of 
the resulting performance profile. And the third assumption 
is needed to guarantee the efficiency of local compilation. 
Using an efficient tabular representation of performance 
profiles, we could perform local compilation in constant 
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Figure 4: DAG representation of F 


time and reduce the overall complexity of compilation to 
be linear in the size of the program. 

Repeated subexpressions 

While the input-monotonicity and the bounded-degree 
assumptions are quite reasonable (and also desirable from a 
methodological point of view), the tree-structured assump- 
tion is somewhat restrictive. We want to be able to handle 
the case of repeated subexpressions. To understand the 
problem, consider the following expression: 

F = E{D{B{A{x)),CiA{x)))) 

Figure 4 shows the DAG representation of F. Recall that 
the purpose of compilation is to compute a time allocation 
mapping that would specify for each input quality and total 
allocation of time the best apportionment of time to the 
components so as to maximize the expected quality of the 
output. But local compilation is only possible when one 
can repeatedly break a program into sub-programs whose 
execution intervals are disjoint, so that allocating a certain 
amount of time to one sub-program does not affect in any 
way the evaluation and quality of the other sub-programs. 
This property does not hold for DAGs. In the example 
shown in Figure 4, B and C are the ancestors of D, but 
their time allocations cannot be considered independently 
since they both use the same sub-expression, A{x). 

To address this problem we have developed a num- 
ber of approximate compilation techniques that work effi- 
ciently on DAGs, but do not guarantee optimality of the 
schedule [21]. The compilation of additional programming 
constructs, such as conditional statements and loops, is an- 
alyzed in [21]. To summarize, a number of compilation 
techniques have been developed that can efficiently pro- 
duce the performance profile of a composite system based 
on the performance profiles of its components. 


V> Run-Time Monitoring 

Monitoring plays a central role in anytime computa- 
tion as it complements anytime algorithms with a mech- 
anism that determines their run-time. We examine the 
monitoring problem in two types of domains. One type 
is characterized by the predictability of utility change over 
time. High predictability of utility allows an efficient use 
of contract algorithms modified by various strategies for 
contract adjustment. The second type of domains is char- 
acterized by rapid change and a high level of uncertainty. In 
such domains, active monitoring, that schedules interrupt- 
ible algorithms based on the value of computation criterion, 
becomes essential. 

Given a compound anytime program, V, whose el- 
ementary anytime components arc E = {^i, a 

monitoring scheme is defined as a mapping that determines 
a certain time allocation for each activation of an elementary 
component. 

Definition 5 A monitoring scheme for a program V is a 
mapping: 

M :ExZ^ 

where E is the set of elementary components ofV. 

is the time allocation to the activation of 
the component. A monitoring scheme supplies the nec- 
essary information to make a compound anytime program 
executable in a well defined way. In defining monitoring 
schemes, we make a distinction between passive and active 
monitoring. 

Definition 6 A monitoring scheme is said to be passive if 
the corresponding time allocation mapping is completely 
determined prior to the activation of the system. 

Definition 7 A monitoring scheme is said to be active if it 
is not passive. That is, the corresponding time allocation 
mapping is partially determined while the system is active. 

Under active monitoring, some scheduling decisions 
are made at run-time. Such decisions are based on the 
actual quality of results produced by the anytime compo- 
nents and based on the actual change that occurred in the 
environment. The main reason why active monitoring is 
necessary in control of anytime algorithms is the problem 
of uncertainty. In an entirely deterministic world, passive 
monitoring can yield optimal performance. However, in 
unpredictable domains there is much to be gained in per- 
formance by introducing an active monitoring component. 

Two primary sources of uncertainty affect the opera- 
tion of real-time robotic systems. The first source is internal 
to the system. It is caused by the unpredictable behavior 
of the system itself. The second source is external. It is 
caused by unpredictable changes in the environment. These 
two sources of uncertainty are characterized by two separate 
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knowledge sources. Uncertainty regarding the performance 
of the system is characterized by the performance profile of 
the system (in particular, we use performance distribution 
profiles to represent the probability distribution of quality 
of results). Uncertainty regarding the future state of the 
environment is characterized by the model of the environ- 
ment. Obviously, the type of active monitoring may vary 
as a function of the source of uncertainty and the degree of 
uncertainty. 

Monitoring contract algorithms 

It is easier to construct contract algorithms than in- 
terruptible ones, both as elementary and as compound al- 
gorithms. Therefore, I will examine first the monitoring 
problem assuming that the complete system is presented 
as a contract algorithm, A. The conditional performance 
profile of the system is t) where q is the input quality 
and t is the time allocation. Assume that Qa{< 1^ t) repre- 
sents, in the general case, a probability distribution. When 
a discrete representation is used, Q^(g, f)[gi] denotes the 
probability of output quality Qi. 

Let So be the current state of the domain and let St 
represent the state of the domain at time f, let qt represent 
the quality of the result of the contract anytime algorithm at 
time t, UA{S,t,q) represents the utility of a result of quality 
q in state S at time i. This utility function is given as part 
of the problem description. The purpose of the monitor is 
to maximize the expected utility of the result, that is, to find 
t for which is maximal. Contract algorithms 

are especially useful in a particular type of domains which 
is defined as follows: 

Definition 8 A domain is said to have predictable utility if 
q) can be determined for any future time, t, and 
quality of results, q, once the current state of the domain, 
So, is known. 

The notion of predictable utility is a property of do- 
mains. The same utility function can be predictable in one 
domain and unpredictable in another. What makes a domain 
predictable is the capability to determine the exact value of 
results of a particular quality at any future time. Hence, the 
state of the domain may change, even in an unpredictable 
way, and utility may still be predictable. To explain this sit- 
uation, we define a function, /(5), that isolates the features 
of a state that determine its utility. In other words, 

V5i,52 f{Sr)=f{S2) UA{Sufq) = UA{S2,fq) 

(5) 

Consider for example a transportation domain that refers 
to traffic on a particular road. The state of the domain is 
defined by the location and velocity of each vehicle and 
/ (5) may be, for example, the traffic density. Using the 
function /, it is easy to show that a domain with predictable 
utility is a domain for which f{St) can be determined once 
the current state, So, is known. In general, three typical 


cases of such domains can be identified: 

1 . A static domain is obviously predictable since St = So 
and f{St) = /(5o). For example, the game of chess 
constitutes a static domain. 

2. A domain that has a deterministic model is predictable 
since future states can be uniquely determined and 
hence f{St) can be determined. For example, a do- 
main that includes moving objects has a deterministic 
model when the velocity of each object is constant. 

3. A domain for which there is a deterministic model to 
compute /(St), once the current state is known, is pre- 
dictable. Note that this does not require a deterministic 
model of the domain itself. An important sub-class is 
all the domains for which /(S) = 0, that is, domains 
in which the utility function depends only on time. 

The initial contract time 

The first step in monitoring contract algorithms in- 
volves the calculation of the initial contract time. Due 
to uncertainty concerning the quality of the result of the 
algorithm, the expected utility of the result at time t is 
represented by: 

^A(St,t) = Y^QA{q,t)[qi]UA{St,t^Qi) (6) 

I 

The probability distribution of future output quality is pro- 
vided by the performance profile of the algorithm. Hence, 
an initial contract time, tc, can be determined before the 
system is activated by solving the following equation: 

tc = argm^{U'j^{St,t)} ( 7 ) 

Under passive monitoring, this initial contract time is used 
to determine (using the compiled performance profile of the 
system) the ultimate allocation to each component. 

In some cases, it is possible to separate the value of 
the results from the time used to generate them. In such 
cases, one can express the comprehensive utility function, 
Ua{S^ ty q) as the difference between two functions: 

UA{Styt,q) -VA{So,q) - Cost{SoA) ( 8 ) 

where Va (S', ^) is the value of a result of quality ^ in a par- 
ticular state S (termed intrinsic utility [14]) and Cost{S, t) 
is the cost of t time units provided that the current state 
is S. Similar to the expected utility, the expected intrinsic 
utility for any allocation of time can be calculated using the 
performance profile of the algorithm: 

VA(S,t) = ^QA{q,t)[qi]VMS,qi) (9) 

i 

Finally, the initial contract time can be determined by solv- 
ing the following equation: 

tc = argm^{V^(So,t) - Cost{So,t)} (10) 
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Figure 5: A sequence of residual sub-systems 


Once an initial contract time is determined, several 
monitoring policies can be applied. The most trivial one is 
the fixed-contract sXxdii^gy that leads to a passive monitoring 
scheme. Under this strategy, the initial contract time and 
the compiled performance profile of the system are used to 
determine the allocation to the components. This alloca- 
tion remains constant until the termination of the problem 
solving episode. The fixed-contract policy is optimal under 
the following conditions: 

Theorem 9 Optimality of monitoring of contract algo- 
rithms, The fixed-contract monitoring strategy is optimal 
when the domain has predictable utility and the system has 
a fixed performance profile. 

Proof: This result is rather trivial since, when the domain 
has predictable utility and the system’s performance pro- 
file is fixed, utility of results at any future time can be 
determined. The initial contract time, that maximizes the 
comprehensive utility, remains the same during the compu- 
tation and no additional scheduling decision can improve 
the performance of the system. □. 

We now look at two extensions to the fixed-contract 
policy for cases with high degree of uncertainty regarding 
the quality of the results. In such cases, the initial contract 
time must be altered by an active monitoring component. 

Re-allocating residual time 

The first type of active monitoring that we analyze in- 
volves reallocation of residual time among the remaining 
anytime algorithms. Suppose that a system, composed of 
several elementary contract algorithms, is compiled into an 
optimal compound contract algorithm. Since the results of 
the elementary contract algorithms are not available during 
their execution, the only point of time where active monitor- 
ing can take place is between activations of the elementary 
components. Based on the structure of the system, an exe- 
cution order can be defined for the elementary components. 
The execution of any elementary component can be viewed 
as a transformation of a node in the graph representing the 
program from a computational node to an external “input” 
of a certain quality. This transformation is shown in Fig- 
ure 5. The quality of the new input is only known when the 


corresponding elementary component terminates. Based 
on the actual quality, the remaining time (with respect to 
the global contract) can be reallocated among the remain- 
ing computational components to yield a performance im- 
provement with respect to allocation that was based on the 
probabilistic knowledge of quality of intermediate results. 

In order to be able to allocate time optimally to each 
component, the monitor needs to access not only the per- 
formance profile of the complete system, but also the per- 
formance profiles of the residual sub-systems. The compi- 
lation problem has to be solved for each residual system. 
For example, for the system modeled by Figure 5, five per- 
formance profiles must be calculated. These performance 
profiles can be derived using the standard local compilation 
technique. The only difference is that the compiler does not 
need to store the allocation to all the components but only 
the allocation to the next component in the activation order. 


Adjusting contract time 

The second type of active monitoring for contract al- 
gorithms involves adjustments to the original contract time. 
As before, once an elementary component terminates, the 
monitor can consider its output as an input to a smaller 
residual system composed of the remaining anytime algo- 
rithms. By solving the previous equation that determines 
the contract time for the residual system, a better contract 
time can be determined that takes into account the actual 
quality of the intermediate results generated so far. 

If the elementary components are interruptible, the 
contract time can be adjusted while an elementary compo- 
nent is running. Given the quality of the results generated 
by that component and its performance profile, a new con- 
tract may be determined. In that case, the new contract may 
affect the termination time of the currently active module 
in addition to affecting the run-time of future modules. 

Monitoring interruptible algorithms 

We turn now to the problem of monitoring interrupt- 
ible anytime computation. The use of interruptible algo- 
rithms is necessary in domains whose utility function is not 
predictable (and cannot be approximated by a predictable 
utility function). Such domains are characterized by non- 
deterministic rapid change. Medical diagnosis in an inten- 
sive care unit, trading in the stock exchange market, and 
vehicle control on a highway are examples of such do- 
mains. Many possible events can change the state of such 
domains and the timing of their occurrence is essentially 
unpredictable. Consequently, accurate projection into the 
far future is very limited and the previous fixed-contract 
approach fails. Such domains require interruptible decision 
making. 
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Active monitoring using the value of computation 

Consider a system whose main decision component 
is an interruptible anytime algorithm, A. The condi- 
tional probabilistic performance profile of the algorithm 
is Qa(q^ 0 where q is the input quality and t is the time 
allocation. As before, Qa{q^ 0 a probability distribution 
and Qa{Qi ^)[Qi] denotes the probability of output quality 
Qi- 

Let 5 be the current state of the domain. Let St be 
the state of the domain at time t. And, let qt represent the 
quality of the result of the interruptible anytime algorithm at 
time t. Ua{S, t, q) represents the utility of a result of qual- 
ity q in state S at time t. The purpose of the monitor is to 
maximize the expected utility by interrupting the main de- 
cision procedure at the “right” time. Due to the high level 
of uncertainty in rapidly changing domains, the monitor 
must constantly assess the value of continued computation 
by calculating the net expected gain from continued com- 
putation given the current best results and the current state 
of the domain. This is done in the following way: 

Due to the uncertainty concerning the quality of the 
result of the algorithm, the expected utility of the result in 
a given future state St at some future time t is represented 
by: 

U'AiSut) = ^QA{q,t)[qi]UA{St,t,qi) (11) 

i 

The probability distribution of future output quality is pro- 
vided by the performance profile of the algorithm. Due to 
the uncertainty concerning the future state of the domain, 
the expected utility of the results at some future time t is 
represented by: 

= ( 12 ) 

The probability distribution of the future state of the domain 
is provided by the model of the environment. 

Finally, the condition for continuing the computation at 
time t for an additional At time units is therefore V OC > 0 
where: 

VOC = U'J^{t^At)-U'J^{t) (13) 

Similar to the case of contract algorithms, monitoring 
of interruptible systems can be simplified when it is possi- 
ble to separate the value of the results from the time used 
to generate them. In such cases, one can express the com- 
prehensive utility function, as the difference 

between two functions: 

UA{Sut,q) ^ VA{S,q)^Cost{[tc^t]) (14) 

where 1^4(5, q) is the intrinsic utility function, S is the cur- 
rent state, tc is the current time, and C ([tc? ^]) is the cost 
of the time interval [tc, t]. Under this separability assump- 
tion, the intrinsic value of allocating a certain amount of 


time t to the interruptible system (resulting in domain state 
5) is: 

K{S,t) = '^QA{q,t)[qi]VA{S,qi) (15) 

i 

Hence, the intrinsic value of allocating a certain time t in 
the current state is: 

VX{t) = Y,P{Si = S)VX{S,t) (16) 

5 

And the condition for continuing the computation at time t 
for an additional At time units is again V OC > 0 where: 

VOC = Uj((i + At) - V^{t) - Cost{[t, t -h A^]) (17) 

Theorem 10 Optimality of monitoring of interruptible al- 
gorithms. Monitoring interruptible algorithms using the 
value of computation criterion is optimal when At ^ 0 
and when the intrinsic value function is monotonically in- 
creasing and concave down and the time cost function is 
monotonically increasing and concave up. 

Proof; A function q is called concave up on a given in- 
terval I if it is continuous, piecewise differentiable, and 
Vx,y 6 / for which q'{x) and q'{y) exist, {x < y) => 
S: Q'{y))- is called concave down if Vx, y € / for 
which q\x) and g'(y) exist, (a: < y) => (g^(^) > 

Note that the assumption of monotonically increasing and 
concave down intrinsic value function is identical to the 
assumption of Dean and Wellman (See [4], Chapter 8, page 
364) that performance profiles have the property of dimin- 
ishing returns. 

Now, suppose that the current time is t\ and that 

VOC = -h At) - Uj((U) - Cost{[tr,h -h At]) < 0 

( 18 ) 

Since the intrinsic value function is concave down, it is 
guaranteed that for any future time t 2 > ti: 

VX(t2 + At) - Vj((t2) < Vj((h + At) - Vj{(ti) (19) 

Since the time cost function is concave up, it is guaranteed 
that for any future time <2 > • 

Cost([t 2 ,t 2 -f A^]) > Cost([ti,ti + At]) (20) 

Hence, it is guaranteed that for any future time ^ 2 * 

VOC = VX(t 2 + At) - V^{t 2 ) - Cost{[t 2 , t 2 + Af]) < 0 

( 21 ) 

And therefore termination at the current time is an optimal 
decision. □ 

Summary 

The monitoring problem has been examined in two 
types of domains. One type is characterized by the pre- 
dictability of utility change over time. High predictabil- 
ity of utility allows an efficient use of contract algorithms 
modified by various strategies for contract adjustment. The 
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second type of domain is characterized by rapid change and 
a high level of uncertainty. In such domains, monitoring 
must be based on the use of interruptible algorithms and the 
value of computation criterion. In domains with moderate 
change and some degree of predictability of future utility, 
one can use an integrated approach. That is, activate the 
system with an initial contract time and then, if additional 
time is available, continue to monitor it using the value of 
computation criterion. 

VI, Applications 

The advantages of compilation and monitoring of any- 
time algorithms have been demonstrated through a number 
of applications. In this section we briefly describe two such 
application. 

Mobile robot navigation 

One of the fundamental problems facing any au- 
tonomous mobile robot is the capability to plan its own 
motion using noisy sensory data. A simulated robot navi- 
gation system has been developed by composing two any- 
time modules [22], The first module, a vision algorithm, 
creates a local domain description whose quality reflects 
the probability of correctly identifying each basic position 
as being free space or an obstacle. The second module, a 
hierarchical planning algorithm, creates a path between the 
current position and the goal position. The quality of a plan 
reflects the ratio between the shortest path and the path that 
the robot generates when guided by the plan. 

Anytime hierarchical planning is based on perform- 
ing coarse-to-fine search that allows the algorithm to find 
quickly a low quality plan and then repeatedly refine it by 
replanning a segment of the plan in more detail. Hierarchi- 
cal planning is complemented by an execution architecture 
that allows for the execution of abstract plans - regardless 
of their arbitrary level of detail. This is made possible 
by using plans as advice that direct the base level execu- 
tion mechanism but does not impel a particular behavior. 
In practice, uncertainty makes it impossible to use plans 
except as a guidance mechanism. 

The conditional performance profile of the hierarchi- 
cal planner is shown in Figure 6. Each curve shows the ex- 
pected plan quality as a function of run-time for a particular 
quality of the vision module. Finally, an active monitoring 
scheme was developed to use the compiled performance 
profile of this system and the time-dependent utility func- 
tion of the robot in order to allocate time to vision and 
planning so as to maximize overall utility. 

One interesting observation of this experiment was 
that the anytime abstract planning algorithm produced high 
quality results (approx. 10% longer than the optimal path) 
with time allocation that was much shorter (approx. 30%) 
than the total run-time of a standard search algorithm. This 



Figure 6: The CPP of the anytime planner 


shows that the flexibility of anytime algorithms does not 
necessarily require a compromise in overall performance. 

Model-based diagnosis 

Model-based diagnostic methods identify defective 
components in a system by a series of tests and probes. 
Advice on informative probes and tests is given using di- 
agnostic hypotheses that are based on observations and a 
model of the system. The goal of model-based diagnosis is 
to locate the defective components using a small number of 
probes and tests. 

The General Diagnostic Engine [5] (GDE) is a basic 
method for model-based diagnostic reasoning. In GDE, 
observations and a model of a system are used in order 
to derive conflicts (A conflict is a set of components of 
which at least one has to be defective). These conflicts are 
transformed to diagnoses (A diagnosis is a set of defective 
components that might explain the deviating behavior of 
the system). The process of observing, conflict generation, 
transformation to diagnoses, and probe advice is repeated 
until the defective components are identified. GDE has 
a high computational complexity - 0(2"), where n is the 
number of components. As a result, its applicability is lim- 
ited to small-scale applications. To overcome this difficulty, 
Bakker and Bourseau have developed a model-based diag- 
nostic method, called Pragmatic Diagnostic Engine (PDE), 
whose computational complexity is O(n^). PDE is simi- 
lar to GDE, except for omitting the stage of generating all 
diagnoses before determining the best measurement-point. 
Probe advice is given on the basis of the most relevant con- 
flicts, called obvious and semi-obvious conflicts (An obvi- 
ous (semi-obvious) conflict is a conflict that is computed 
using no more than one (two) observed outputs). 
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In order to construct a real-time diagnostic system, 
Pos [13] has applied the model of compilation of anytime 
algorithms to the PDE architecture. PDE can be analyzed 
as a composition of two anytime modules. In the first 
module, a subset of all conflicts is determined. Pos im- 
plements this module by a contract form of breadth-first 
search. The second module consists of a repeated loop that 
determines which measurement should be taken next, takes 
that measurement and assimilates the new information into 
the current set of conflicts. Finally, the resulting diagnoses 
are reported. 

Two versions of the diagnostic system have been im- 
plemented: one by constructing a contract algorithm and 
the other by making the contract system interruptible using 
our reduction technique. The actual slow down factor of 
the interruptible system was approximately 2, much better 
than the worst case theoretical ratio of 4. 

VIL Conclusion 

We presented a model for intelligent robot control that 
is based on compilation and monitoring of anytime algo- 
rithms. It offers both a methodological and a practical 
contribution to the field of real-time deliberation. The main 
aspects of this contribution include: (1) simplifying the 
design and implementation of complex intelligent robots 
by separating the design of the performance components 
from the optimization of performance; (2) mechanizing the 
composition process and the monitoring process; and (3) 
constructing machine independent real-time robotic sys- 
tems that can automatically adjust resource allocation to 
yield optimal performance. 

The study of anytime computation is a promising and 
growing field in artificial intelligence and in real-time sys- 
tems. Some of the primary research directions in this field 
include; ( 1 ) Extending the scope of compilation by studying 
additional programming structures and producing a large 
library of anytime algorithms; (2) Extending the scope of 
anytime computation to include the two other aspects of 
robotic systems, namely sensing and action; and (3) Devel- 
oping additional, larger applications that demonstrate the 
benefits of this approach. The ultimate goal of this research 
is to construct robust real-time systems in which percep- 
tion, deliberation and action are governed by a collection 
of anytime algorithms. 
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Abstract 

An autonomous robot must be able to learn maps 
of its environment while accomplishing meaning- 
ful tasks. Such ^passive^ map-learning i5 difficult, 
since it cannot rely on active exploration. How- 
ever, it gives the robot more flexibility in how it 
learns about a complex novel environment. The 
primary difficulty is that incorrect maps may be 
inferred, since insufficient information may be 
available at any one time (since exploration is 
disallowed). We address this problem by allow- 
ing maps to contain errors, while correcting those 
errors when possible via a set of heuristic error- 
correction strategies. Results in a realistic sim- 
ulation with a random walk (to simulate worst- 
case explorative action) demonstrates the efficacy 
of the basic technique. 

Passive mapping may still be inefficient, so we 
incorporate intermittent exploration while main- 
taining the benefits of the passive approach. This 
is done by using predefined ^opportunity scripts* 
which direct the robot in ways that improve the 
map. Scripts are short plans which are applied 
depending on whether or not they interfere with 
other robot goals. The scripts we have imple- 
mented mostly use known techniques to improve 
mapper efficiency. Occasional use of these scripts 
over a base random walk strategy shows a speedup 
in map convergence, demonstrating the efficacy 
of intermittent exploration. Also, by using inter- 
mittent exploration, very complex worlds could be 
learned efficiently. 

1 Motivation 

While robotic manipulation technology has revolutionized 
manufacturing in recent years, the potentials of mobile 
robotics remain virtually unused. There are a number of 
reasons for this, some of them sociological, but the main 
reason is that the technology of mobile robotics is not yet 
sufficiently mature for most practical applications. One 
of the main research areas which requires development is 
map-learning. By this, we mean the automatic acquisition 
by a mobile robot of a model of its large scale environ- 
ment (floor layout, for example). This is needed, even in 
known environments, because hard-wiring the structure of 
its environment into a robot can be very costly, and it is 
difficult (if not impossible) to keep such a representation 
up to date when the environment is changed. Naturally, 
there are also cases where the structure of the environment 


cannot be known accurate in advance even to the system 
designers; automated mapping is required in those cases. 

There are many useful tasks that mobile robots can per- 
form that require the use of some sort of environmental 
map. One example is that of an office or factory courier, 
whose function is to transfer materials between areas of 
a building. This task is one of the few for which mobile 
robots are currently being used; TRC has a robotic courier 
installed in hospitals, for delivering non- critical items to 
patients upon request. Their robot contains a detailed, 
preprogrammed map of the hospital building it works in; 
the hospital environment was also engineered somewhat 
to support reliable navigation (beacons were places, for 
example). Effective map-learning would cut the costs of 
installing such a system, by both easing the initial setup, 
and by making the robots more flexible (they could be 
transferred between different buildings, say). 

Another class of tasks are those in which little or no 
a priori information about the environment exists. One 
such task, ripe for robot use, is search-cmd-rescue mis- 
sions. Rescuing people often involves going into hazardous 
environments, so it would be desirable to use robots wher- 
ever possible. Such rescue missions, whether from burning 
buildings or from caves, often require learning the struc- 
ture of the environment on the fly, so that searching is done 
efficiently (time is usually of the essence). Also, there is 
no time to explore for exploration’s sake (we will return 
to this point below). A task which is similar, though in 
a completely different domain, is planetary exploration. 
Getting men to Mars to carry out scientific exploration 
and experimentation is, at present, a difficult proposition. 
A cost-effective solution that has been proposed is to send 
robots to gather scientific data. These robots will need 
to move about in large areas of the surface to gather that 
data, and hence will need some sort of internal mapping 
to support navigation. 

l.l Passive mapping 

There are several features of the tasks described above that 
underscore important issues for robotic map learning. The 
first is that mapping does not occur in a vacuum. It is 
a part of a larger system, aiding navigational planning. 
A related point is that mapping should take place during 
normal goal-directed task execution. In fact, a ‘mapping 
pheise’ wherein the agent maps out its entire environment 
may be wholly infeasible due to the environment’s size or 
changeability. We therefore propose the view that map- 
ping should be ‘passive’, and not require controlling the 
robot’s actions. It thus functions as a static map, as far as 
planning is concerned, but the map’s utility transparently 
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improves over time. A schematic diagram of the high-level 
cognitive architecture our view implies is shown in Fig- 
ure 1. The mapper and deliberator (action selector) are 
functionally independent. Exploration enters deliberation 
through a sort of ‘back door*, via suggested exploration 
scripts (described below in Section 5). 



Figure 1: Cognitive modular divisions in the passive map- 
ping paradigm. Solid arrows indicate control flow; dashed 
arrows indicate information flow. 


Our approach may be contrasted with much previous 
research in map learning, where the mapper is viewed as 
a procedure which, after some amount of time, outputs 
a ‘correct’ map; this representation is then to be used 
for planning and reasoning. This paradigm, or variations 
thereof, is ubiquitous (eg, [3; 4]). These methods typi- 
cally require the mapper to be ‘active’ and to take control 
of the robot when uncertain information must be verified. 
This is needed due to the desire to learn a ‘complete’ and 
‘correct’ map in finite time; however, this active approach 
interferes with goal-achievement. This approach has other 
problems in the real world as well, since the real world is 
(practically) open-ended — the world to be learned cannot 
be bounded. Also, attention must be paid, in constructing 
a world representation, to the use to which it will be put. 
We address these issues below. 

1.2 Overview 

In the next section, we describe a general framework for de- 
veloping adaptive models of a robot’s environment, which 
we have used to develop our mapping system. We then 
describe the representation scheme we use for mapping, 
which incorporates both topological and metric informa- 
tion. In Section 4, we describe the algorithms used in 
the mapping system, including mapping-error diagnosis 
and correction methods. Given this passive mapper, we 
then develop methods for intermittent exploration, to im- 
prove mapping efficiency. We then describe our results in 
a robotic simulation. Section 7 reviews related previous 
work in map-learning. We close with a discussion of how 
our methods could be used in practice, and future direc- 
tions for our research. 


2 Adaptive Modeling 

2.1 Adaptive State-Space Models 

We view the mapping system as a sort of adaptive state- 
space model. There are two fundamental operations for 
which a state-space model is used: state estimation and 
state prediction. Planning uses these operations in con- 
junction with a domain theory describing the physics of 
the world. We may thus view such a model as a black box 
which outputs a state prediction given a previous state and 
a robot action, and outputs an improved state estimate 
given a state prediction and sensory input. This is essen- 
tially what is known to control theorists as an observer. It 
gives a very general conceptual framework, encompassing 
both continuous estimation methods such as the Kalman 
filter and discrete methods such as Markov chain models. 
The fundamental point we wish to stress is the use of the 
model as a black box for estimation and prediction, as far 
as the rest of the planning/control system is concerned. In- 
ternal issues of representation, and indeed whether or how 
the representation changes over time, should be largely ir- 
relevant to the rest of the system. In what follows, we 
develop an architecture for such systems, and then show 
how we have applied it to the specific problem of mobile 
robot map-learning. 

2.2 Discretizing the World 

Robots typically live in continuous state-spaces (eg, the 
plane for a mobile robot); to represent these spaces ef- 
fectively, some sort of discretization must be done. Our 
work focuses on robots designed to operate in indoors en- 
vironments (office buildings, factories, etc.). We adopt a 
technique based on Kuipers’ topological mappers [13; 14], 
which use control laws with good stability properties (ac- 
tions) to pick out distinguished ‘places’ in a continuous 
space. This allows a distinguished set of points (actually, 
small regions) to constitute ‘waypoints’. Waypoints may 
be corridor intersections, or areas near particular pieces of 
equipment; the main requirement is that they be recogniz- 
able. Waypoints can be used to represent the structure of 
the entire space, relative to the capabilities of the robot. 
As noted by Kuipers, this approach allows a representation 
to directly support navigational planning. Abstractly, the 
state space is represented as a finite state machine with 
non-deterministic transitions (which can often be assigned 
transition probabilities). 

2.3 The Architecture 

We now present an architecture for adaptive discrete state- 
space models. We first divide the system at a high level 
into an estimator an adapter (refer to Figure 2). The 
core of the estimator is the loop between projection and 
matching. The projector predicts new states given old 
state estimates and control inputs. There may be mul- 
tiple state estimates in the system, which we call tracks 
(each tracks a possible true state). The matcher decides 
which states are consistent with each predicted estimate, 
given the current perceptual input. Thus far, we have 
the standard recursive estimation framework. Now, the 
spaces we are interested in are both very large (thousands 
of states) and relatively unstructured (without even ap- 
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proximate closed form estimators). Hence we need index- 
ing, to find likely candidate states to feed to the matcher. 
Further, in different situations, different matching meth- 
ods will be appropriate (for error recovery, for example). 
Therefore, the matcher is divided into a general match- 
ing engine, dependent only on the representation language, 
and a task-dependent set of matching methods {matchers). 
The matching engine also provides a method of conflict 
resolution to determine which matchers are applicable in 
given circumstances. 



Figure 2: Adaptive state-space model architecture. 


The adapter consists of an updating module and a re- 
structuring module. The updater adjusts the parameters 
of states in the representation, for example, the position 
of a known waypoint. It also can monotonically change 
the topology of the represented state space by, for exam- 
ple, adding new state transitions. The exact type of up- 
dating that is performed depends on the matches found 
by the matcher; different matchers may (and usually will) 
have different associated update methods. A deeper sort of 
adaptation is restructuring, which uses information about 
the large-scale structure of the known state-space as well as 
integration of observations over longer terms to adjust the 
structure of the state-space. This may involve splitting or 
coalescing states, adjusting hierarchical relationships, and 
so on. Like matching, and for the same reason, we divide 
this module into a general restructuring engine and a task- 
dependent set of restructurers. To alleviate the problem of 
searching aimlessly in the state space for restructuring to 
do, it can be advantageous for the updater to inform the 


restructuring module when it performs an update, since 
a change made by the updater to the representation may 
trigger a restructuring operation. 

3 Diktiometric Representation 

The first step in specializing the architecture described 
above to robot mapping is the specification of a represen- 
tation language for our maps. The straightforward method 
using the discrete state-space approach amounts to topo- 
logical mapping, the method suggested by Kuipers in [13]. 
The world is represented as a graph of waypoint nodes, 
labeled with local perceptual information. Transitions are 
labeled with robot control routines which constitute the ac- 
tions that move the robot between waypoints ([9] describes 
how these routines are derived). We extend this notion 
to directly take into account geometric knowledge as well. 
Diktiometric^ representation explicitly represents geomet- 
ric relations between waypoints. A diktiometric represen- 
tation (a diktiometry) consists of two graphs, a path graph 
and a reference graph (see Figure 3). Path graph nodes 
represent waypoints, and arcs represent action transitions. 
Nodes in the reference graph represent local coordinate 
frames, with links giving known geometric relations. The 
two graphs are connected by reference linkSy giving the 
positions of waypoints with respect to particular local ref- 
erence frames. 



Figure 3: A schematic picture of a diktiometry, where 
doorways are taken as ‘waypoints’. Action links are solid, 
reference links are dashed. 


3.1 Uncertain Geometry 

We represent uncertain geometric relations between way- 
points relative to local reference frames with limited cov- 
erage. This ensures that, provided the robot knows which 
reference frame’s domain it is in, relative uncertainly re- 
mains bounded. This is so regardless of the method used 
to represent uncertainty. In the multi-dimensional Gaus- 
sian distribution approach [20], the local reference frame 

^From the Greek Bikwoi/ meaning ‘network’, and pirpou 
meaning ‘measurement’. Diktiometric representations repre- 
sent the world as a network of waypoints with relative positions. 
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approach amounts to maintaining a set of smaller covari- 
ance matrices (with smallish eigenvalues) instead of a sin- 
gle large covariance matrix for the entire system (which will 
necessarily have some large eigenvalues, hence large uncer- 
tainty, even between nearby points). The local reference 
frames are related to each other by small covariance ma- 
trices; a position in any frame can be related to any other 
by appropriate composition. The local method is also a 
performance win when locality can be established (though 
doing so may incur extra costs). Rather than using such a 
statistical representation for odometric uncertainty, how- 
ever, we advocate the use of bounding intervals. 

The primary reason that Gaussian noise models are so 
widely used is the Kalman filter, and its non-linear cousin, 
the extended Kalman filter. These are the optimal lin- 
ear recursive updating procedures, given a truly Gaussian 
noise model. However, when this model is not correct, 
the Kalman filter can be suboptimal and can even diverge. 
We therefore prefer a non-distributional approach. Rather 
than representing a probability distribution on the true 
value of a measurement, we give bounds on the possible 
true values. These bounds give us a set of possible val- 
ues within which true must lie. Projection and updating 
can be done easily and efficiently using interval arithmetic 
[1]. Our contention is that odometry is subject to so many 
unmodellable sources of noise (slippage, bumps, voltage 
fluctuations, etc.) that the best that can be hoped for is 
reasonable bounds on relative position. In a typical of- 
fice environment, with current equipment, odometric error 
over distances of several meters is about 10% of distance 
traveled^. This gives reasonably tight bounds on relative 
motion; use of sensory feedback (eg, visual motion analy- 
sis) may also help. 

4 The Mapping System 

The adaptive state-space architecture described above, 
combined with diktiometric representation, provides a 
framework for designing a robot mapping system which 
supports the flexible navigation planning tasks we wish to 
address. This section describes in more detail how this 
works. 

4.1 Indexing waypoints 

For some applications using the adaptive state-space model 
paradigm, indexing is trivial, due to there being a relatively 
small number of states. If, however, we wish to learn dik- 
tiometries of large-scale spaces in an open-ended fashion, 
we must deal with thousands of waypoints (at least). It 
is simply infeasible to examine the entire diktiometry for 
matching waypoints to extend a track. On the other hand, 
there is no fool-proof way of generating only the most ‘sim- 
ilar’ candidates that we know of. Hence, we developed 
heuristic indexing methods that should work well in prac- 
tice, There are three categories of indexing we examine: 
expectaiionaly geometriCy and perceptual 

Each of the geometric and perceptual indexing modules 
produces a stream of sets of candidate waypoints. Each 
set in a stream contains better candidates than those fol- 
lowing it, while members of a set are assumed to all be 

^Jonathan Connell, personal communication. 


equivalently good. The indexing methods are integrated 
by running the modules in parallel and combining the can- 
didate sets that come out. 

Expectations The simplest form of indexing uses the 
path graph to predict the expected state of the robot after 
executing the current action. Given a particular current 
waypoint, the expectational candidates are those which 
are predicted by the last action taken from that waypoint. 
Naturally, there may be more than one, since actions can 
be non-deterministic. Expectational indexing produces 
one candidate set, used together with the first sets pro- 
duced by other indexing. 

Geometric Indexing The second sort of indexing is ge- 
ometric indexing, based on waypoint positions. The basic 
idea is to find waypoints whose position relative to a track’s 
is consistent with the last position change. We will discuss 
two indexing methods which use the reference graph to find 
waypoints likely to be near the current (projected) robot 
position. 

The simplest method is to use a depth-limited search 
through the reference graph, starting from the current 
track’s frame. If the new position estimate may be con- 
sistent with one of a frame’s waypoints, the waypoints are 
checked individually for consistency and candidates sug- 
gested. This method assumes that the area has been rea- 
sonably well explored, so that the robot moves between 
frame regions known to be neighbors. This implies that 
each ply of the search is less plausible (as the search gets 
farther from the source), giving, as desired, a stream of 
candidate sets. On the other hand, the assumption of 
nearby frames giving correct candidates will not always be 
correct, particularly in the early stages of learning (how- 
ever, more frames may be searched then, since there is less 
to search). On the other hand, the farther a frame is in the 
reference graph, the less useful information (in general) it 
gives for constraining the robot’s position. 

Perceptual Indexing The objective of perceptual in- 
dexing is to quickly find those stored percepts that are 
most similar to the current one. Furthermore, since the 
robot will never be in quite the same configuration twice, 
the indexing method should be robust with respect to small 
changes in position and orientation. We have developed an 
image-based method for waypoint recognition, using the 
notion of image signatures. An image signature is an array 
of values, each computed by a measurement function from 
a subset of the image (the image is tesselated). As demon- 
strated in [8], signatures can be matched to each other 
for fairly reliable recognition. The problem here is how to 
index this database so that similar signatures taken at dif- 
ferent orientations will be found quickly. This can be done 
by indexing the signatures by their columns, since if two 
signatures are taken at different rotations, they will match 
at a horizontal offset. Hence, an input signature’s columns 
are used to index the columns close to them, marking each 
signature found at the offset implied by the column match. 
When enough of a database signature’s columns have been 
marked, the signature’s waypoint is suggested as a match 
candidate. 

Column indexing can easily be implemented as a k-d 
tree [18]; an iteratively growing hypercube search is used 
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to search progressively further and further from the input 
column. In this way, good candidate waypoints can be 
found based on perceptual cues, even if they are geometri- 
cally distant. A similar approach could also be taken using 
3D estimates of the positions of visual features. Using the 
method of Atiya and Hager [2], feature triples can be rep- 
resented by a set of 6 parameters describing a triangle. If 
triples of features going around the robot’s viewpoint are 
stored thusly, a similar k-d tree approach can be used for 
indexing. Using this method, more sophisticated methods 
of 3D recognition and position registration can be done 
as well, given the required perceptual capabilities (eg, a 
system such as [21]). 

4.2 Matching and updating 

State identification in diktiometric mapping amounts to 
matching the robot’s projected position and sensory in- 
puts with candidate waypoints (generated as above). If we 
could guarantee that no mapping errors would ever occur, 
then the matching problem amounts to little more than 
filtering possibilities for consistency. However, as noted 
above, this is never the case, and so error diagnosis and 
correction must continually be performed. One way in 
which this is done is through the use of multiple match- 
ers, each indicating a particular state of affairs, including 
mapping errors. Each matcher consists of a match test 
which compares the projected robot state with waypoints 
in a candidate set, and an update method which is applied 
to waypoints that are determined to match. Resolving be- 
tween multiple applicable matchers is done by arranging 
them in a partial order; the maximal applicable matchers 
are used. This preference relation is constructed so that 
more reasonable decisions take precedence over less likely 
decisions (posit as few and as plausible errors as possible). 
If no matchers apply, a new waypoint node is created. 

4.2.1 Matchers 

There are four main matchers that we currently use. 
Two that correspond to normal extension of the map are 
Continue and Link. Continue matches a waypoint that 
was expected with consistent position estimate and view; 
the waypoint’s position estimate and view set are updated. 
Link matches a waypoint with consistent position estimate 
and view which was unexpected, and so also adds a new 
action link to the path graph. Two other matchers correct 
for positional inconsistency caused by incorrect waypoint 
identification or odometric error. To deal with the case 
where a waypoint’s position interval does not contain its 
true position (due to update with an outlier), the system 
keeps track of the waypoint’s nominal envelope, the least 
bounding interval of the estimates used for updating the 
waypoint’s position. This is asymptotically guaranteed to 
contain the true position. Thus, E-Match matches a way- 
point such that the projected robot position is consistent 
with the waypoint’s nominal envelope, indicating a possi- 
ble waypoint position inconsistency. The waypoint’s posi- 
tion estimate is the grown to contain the nominal envelope, 
presumably consistent. To deal with a track drifting from 
true, N-Match allows a match with a waypoint near the 
track’s projected position; the track is then ‘snapped’ to 
the waypoint. A fifth, ‘pseudo-match’ is used for waypoint 
creation; when it is chosen to be used, a new waypoint is 


added with a track’s projected state. 

4.2.2 Dynamic priorities 

A static priority scheme for matchers, while easy to im- 
plement, will seldom really be appropriate. For exam- 
ple, Continue should only be preferred to Link in well- 
explored areas, otherwise it is no better (since few links 
are known). We thus propose a dynamic scheme for pri- 
oritizing matchers, using estimates of the quality of the 
mapper’s knowledge. This is done using local grid-based 
methods to maintain estimates of how well areas have been 
visited or traversed, as well as confidence in each individ- 
ual track, based on its recent history. We can thus express 
preferences as the following: 

• Prefer Continue to Link if the region just traversed 
has been well traversed (hence probably mapped) in 
the past. 

• Prefer E- Match or waypoint creation to N- Match 
when a track has high confidence, and the converse 
when a track has low confidence. 

In a similar fashion, other ideas of when peirticular types 
of matching are more appropriate can be expressed. This 
framework of a dynamic partial order controlled by meta- 
knowledge determined preferences seems both flexible 
enough to encode the required diagnostic knowledge, while 
providing a simple and modular mode of expression. 

4.3 Transients 

The first function of the restructurer in our system is to 
deal with map errors due to transients, non-existent states 
and transitions hallucinated due to nonreproducable con- 
ditions. Of course, if a hallucination is consistent, it may 
(usually) be considered real enough, as far as the robot is 
concerned. The only way to detect these transients, with- 
out taking over control of the robot, is to maintain statis- 
tics of when each link is thought to have been traversed and 
each waypoint is thought to have been visited. If these are 
compared with the frequency that the link or waypoint 
was expected, transients will be distinguished by a very 
low frequency of occurrence. The offending link/waypoint 
is then removed from the diktiometry. This also helps deal 
with (slowly) changing environments, 

4.4 Waypoint restructuring 

A deeper sort of error that cannot be discovered by us- 
ing imprecise matching (hence requiring restructuring) is 
structural inconsistency. Incorrect waypoint identification 
can lead to either multiple waypoints in the world be- 
ing represented by a single waypoint node (polytopy) or 
the converse, a single real-world waypoint represented by 
multiple nodes (monotopy). These sorts of errors can be 
dealt with by the system’s restructurers. The concept be- 
hind these restructurers is that while one observation of 
a waypoint may not be sufficient to determine its iden- 
tity, integration of many observations can yield statisti- 
cally significant evidence of environmental structure. We 
deal below with two restructurers, one for splitting to deal 
with polytopy, and one for merging to deal with mono- 
topy. To a great extent these two are inverses, and we 
apply similar methods for both, as summarized in Table 1. 
We divide the constraints applied into three categories: 


814 


Constraints 

Splitting 

Merging 

Geometric 

Multimodal dis- 
tribution of po- 
sition observan 
tions for a way- 
point node 

Unimodal distri- 
bution of posi- 
tion observations 
for two different 
waypoint nodes 

Local 

Separable corre- 
lated sets of 
input-output ac- 
tion link pairs 

Nearly identical 
output link sets 

Non-local 


Multiple 

mergable track 
histories 


Table 1: Constraints for waypoint restructuring. 


geometric constraints on the positions of waypoints, local 
path constraints about local functional relationships in the 
path graph, and non-local path constraints applied to larger 
portions of the path graph. 

Each of these restructurers can, in principle, operate 
independently. To integrate them, we propose to use a 
‘veto-based* strategy. Each restructuring method depends 
on certain thresholds, giving the desired confidence in a 
diagnosis of mono/polytopy. If we give each two thresh- 
olds, such that when the higher is satisfied we say that a 
diagnosis is proposed^ and when the lower is not satisfied 
the diagnosis is rejected^ the three strategies can be used 
in tandem as follows. If a diagnosis of either monotopy or 
polytopy is proposed by at least one restructurer, and is 
not rejected by any, we accept the diagnosis. This provides 
a sensible and modular integration of multiple constraints 
for diktiometry restructuring. 

Geometric constraints The essential insight here is 
that by making the reasonable assumption that all way- 
points are at least some minimum distance apart (call it 
^sep)j we can discover when sets of position observations 
(from odometry) come from one or many waypoints. If 
then make the further weak statistical assumption that 
the distribution of position estimates for a waypoint is uni- 
modal, we can use a multimodality test to test for struc- 
tural inconsistency. If a single waypoint node represents 
two re 2 il waypoints, we would expect the distribution of 
position estimates matched to the node to be bimodal. 
Similarly, if two nodes correspond to one waypoint, their 
combined observation set should be unimodal. This will 
not always work, so we use other constraints as well. 

Local path constraints The next sort of restructuring 
we consider uses a functional kind of constraint. The idea 
is that each waypoint has a consistent, if stochastic, input- 
output behavior (eflfects of performing actions). This being 
the case, polytopy is indicated by inconsistent effects at one 
waypoint, and monotopy by two waypoints with (nearly) 
identical effects. This is similar to Chrisman*s approach 
to the closely related perceptual aliasing problem found in 
reinforcement learning [6]. Here, the merging method is 
simpler — if two waypoints have nearly identical incoming 
and outgoing action link sets (greater than a large fraction 
go to the same waypoint via equivalent actions), then the 
waypoints should be merged. The waypoints also should 


have been visited often enough to ensure that the known 
links are representative. Splitting requires examining how 
well incoming links predict outgoing links. If the incoming 
links can be partitioned into two sets such that the sets* 
‘images* (the sets of outgoing links taken after coming in 
on each link in a set) are (nearly) disjoint, then a split is 
indicated. This heuristic detects cases where a waypoint 
node does not adequately represent a functional state of 
the robot. By assumption, each waypoint corresponds to 
a single functional state (though the converse need not 
hold). 

Non-local path constraints A third method uses non- 
local path constraints for determining restructuring. Cur- 
rently, this only applies to merging. One problem with 
the local path approach to diagnosing monotopy is merg- 
ing deadlock^ where two different waypoints both need to 
be merged, but each inhibits the other being merged since 
local information does not suffice (the link sets are not 
identical). Hence what is needed is a sort of sub-graph 
isomorphism. Unfortunately, this is intractable, so we use 
a heuristic approximation. As the robot travels through 
the world, each track maintains a history of the waypoints 
it matched to. As well, each waypoint P in a history is 
associated with other waypoints that (a) were considered 
as matches but rejected, and (b) could be mergable with P 
(ie, their position estimates are consistent). An aurbitrary 
length limit is placed upon histories to prevent them from 
growing without bound. When a track matches a way- 
point, it deposits copies of its histories at the waypoint. 
When two waypoints have enough histories consistent with 
each other, the histories are used to ‘sew up* a portion of 
the path graph. This may introduce spurious merges, but 
with conservative threshold choices, it can work in concert 
with the two methods described above to provide effective 
restructuring. 

5 Opportunistic Exploration 

Even though passive mapping, as described above, is im- 
portant so that the robot can pursue its goals while learn- 
ing, it can also be inefficient. This problem can be ame- 
liorated somewhat, without sacrificing the benefits of pas- 
sivity, by allowing the mapper to advise the deliberator of 
actions that the mapper would find useful. These can be 
treated by the deliberator as goals to satisfy when feasi- 
ble; failure of a mapper goal does not invalidate a plan. 
So, if the robot decides it has an extra ten minutes before 
it has to deliver a package, it can take a short trip down a 
side corridor to see where it leads. The main point is that 
ultimate control lies inside the deliberator, which has the 
responsibility of balancing the utility of the various goals 
it must achieve. 

We implement this idea by using opportunity scripts — 
short, stereotyped sequences of actions designed to help 
mapping in particular situations (an early version of this 
is described in [11]). These are suggested to the deliber- 
ator by an opportunity checker^ which examines the cur- 
rent mapper state to determine if any scripts are applica- 
ble. Those which are, are sent to the deliberator where 
they may get executed. Even if they aren’t, there is no 
great loss, since the mapping system *s correctness does 
not depend on the actions the robot takes. There are two 
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If: 


Do; 


• Uncertainty otlhVlast Aposjtion high 

• Uncertainty of all track positions high 

• Didn't just retrace step 

• Try to get a fixation on the last waypoint 
visited; 

• If found, go there, otherwise exit; 

• Try to return, if possible. 

Retrace step 


If: 


I Do- 1 


• I here is a nearby waypoint whose uncer- 
tainty is high 

• Only one track is current 

• I ry to get a fixatio^n on the uncertain way> 
point based on its relative position; 

• If fixation found, go there, otherwise exit. 

Head for uncertainty 


sorts of opportunities that can be dealt with in this way — 
exploration and experimentation. 

There are two reasons to use opportunity scripts to aid 
mapping. One is that a robot will not naturally explore the 
world efficiently, since its actions are determined by other 
considerations. The other is that mapper-directed activ- 
ity may improve the reliability of mapping decisions and 
reduce the introduction of errors into the map. We have 
developed and tested some heuristic opportunity scripts to 
thus aid mapping; they are described below. 



Do: 


• I here is only one track, with high uncer- 
tainty 

• There is nearby waypoint with low uncer- 
tainty 

• Try to go to that waypoint. 

Head l^R certainty 


sitional estimate. This can only be reasonably tried, of 
course, if the robot’s current waypoint is unambiguous. 


5.1 Exploration scripts 

The essential idea of using opportunity scripts to improve 
mapping is that they can be performed whenever a high- 
level decision process determines that other goals can be 
put off for a short while. This implies, first of all, that these 
scripts must apply in general circumstances, as the mapper 
has no control over when they will be called upon. Sec- 
ondly, the scripts must be fairly limited in duration, so that 
they can do their business of improving mapping and then 
let the robot get back to its high-level goals. Scripts have 
two components, an application test which determines if 
the script is relevant, and a (loop-less) procedure which is 
executed if the script is applied. Scripts use the mapper’s 
data structures, in particular the set of current tracks and 
the map itself. 

We have investigated a number of exploration scripts. 
Some seek to reduce positional uncertainty. Others at- 
tempt to find new waypoints and action links. Scripts are 
also used to reduce ambiguity in the map or in the robot’s 
position estimate. Some probe map waypoints which seem 
likely to not really exist. Keep in mind that all these scripts 
do is direct the behavior of the robot, indirectly focusing 
the attention of mapping system. 

Retrace step : 

One important source of error and ambiguity in a map is 
positional uncertainty. When the robot performs an action 
with a very uncertain estimate of relative position, and the 
robot’s a posteriori position estimate (after matching to 
its map) is also particularly uncertain, a simple and useful 
heuristic for reducing uncertainty both in the map and in 
the current position estimate is to retrace the last step 
taken. That is, the robot tries to return to the waypoint 
it just came from; if it manages that, it tries to get back 
to where it started. 

Head for uncertainty : 

A more generally applicable heuristic for reducing map 
uncertainty is to simply head for nearby waypoints with 
large positional uncertainty, under the assumption that 
if they are reached, new constraints will improve the po- 


Head for certainty : 

The converse of the last script is useful when the robot’s 
positional uncertainty gets too high, and that is to head 
for a nearby waypoint whose position is known very pre- 
cisely. If the robot reaches and recognizes the waypoint, 
the robot’s position will then also be known more precisely, 
improving further mapping. 

Disambiguate tracks : 

It will often occur that the robot’s estimate of its cur- 
rent position will be ambiguous. If there are thus multiple 
current tracks, a good way to distinguish between them is 
to try to perform an action with different results for the 
different possible waypoints the robot is at. This will usu- 
ally result in the incorrect track becoming inconsistent and 
thus dropped. 

Probe ambiguous action : 

One kind of map ambiguity is when a waypoint has mul- 
tiple action links coming from it labeled with the same ac- 
tion. While this ambiguity may be inherent, it may also be 
that one of the links is due to a transient; hence, further 
examination is warranted. This is achieved by attempting 
to perform such an ambiguous action — this will tend to 
speed up elision of any transients, and just maintain the 
real action links. 

Probe splittage : 

The kind of map ambiguity we consider for now is where 
two identical links from different waypoints end at the 
same waypoint and there is reason to believe that only 
one is real. This happens when a waypoint is split (see 


If: 

• There are multiple current tracks. 


• Choose an action link whose destination is 
known reachable from some, but not all. of 

Do: 

the tracks’ waypoints; 

• Try to go to that link's destination way- 
point. 


1 Disambiguate tracks 
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If: 

• 1 here is a sinsle current track. 

• There are multiple links from the current 

waypoint labeled with the same action. 

Do: 

• Perform that action. 

Probe ambiguous action | 


If: 

• There is a single current track, 

• The current waypoint was recently split off 
from another waypoint. 


• Choose an action link which both waypoints 

|Do:| 

have in common, 


• Try to traverse it. 

1 Probe splittage | 


above). Since both waypoints resulting from a split have 
copies of the same action links, many of those links will 
be invalid. Hence, when the robot is at a waypoint which 
resulted from a recent split, the Probe splittage script 
tries to perform an action that has (in the map) identi- 
cal consequences in the two split-off waypoints. This will 
accelerate elision of the invalid action links. 



|p°-l 


• There is only one current 

• The current waypoint has a neighbor which 
has been visited less than a threshold num- 
ber of times. 

• Choose such a rarely visited neighbor, 

• Try to go to there. 

Head for rare waypoints 


tion required before it can actually be performed and a set 
of exploration scripts to help gather that information. For 
example, before performing a merge, a script may be used 
to probe the distinctness of the two waypoints. This infor- 
mation is associated with the waypoint (s) involved, so that 
when the robot returns, the scripts are then suggested to 
the deliberator for execution. Again, as with exploration 
scripts, the particulars of the experimentation scripts used 
depend on the types of updating and restructuring done. 
We are currently working on developing a set of experi- 
mentation scripts for our system, but they have not yet 
been implemented. 

6 Results 


Head for unexplored area : 

Most of the previous scripts have dealt with improving 
the system *s knowledge of waypoints already in its map. 
Finding new, unknown waypoints in an efficient manner, 
however, would also be useful, so that the world may be 
more quickly explored. We thus try to head for an area of 
the world which has not likely been visited by the robot 
before. To decide that this holds of some area, each lo- 
cal reference frame has associated with it a coverage grid^ 
which tesselates the area about the frame into a coarse 
grid, and keeps track of an estimated certainty that the 
robot has visited each grid cell. Then, an area is deemed 
to be unexplored if the likelihood of part of it having been 
visited in the past is sufficiently low. Thus, HEAD FOR 
unexplored area looks in the vicinity of the current 
waypoint for a nearby area which looks unexplored, and if 
one is found, attempts to head in its direction. 

Head for rare waypoints : 

Recall that trsmsient environmental features are even- 
tually elided by the mapping system by noting their fre- 
quency of apprehension. This process can be speeded up if 
the robot tries to reach waypoints which look likely to be 
tr 2 msients. Since transients, by their very nature, are not 
encountered often, it is likely that a waypoint that has not 
been visited often is transient. If so, then repeatedly trying 
to reach the waypoint will cause the system to notice that 
it is not encountered as expected, and so it will eventually 
be elided. If it is not a transient, then the mapper will just 
gain a bit more information about the waypoint. 

5 . 2 Exp eriment at ion 

The main type of experimentation that can be done in our 
framework delays diktiometry adaptation until more in- 
formation has come in. Whenever a radical update or re- 
structuring would normally be performed, a note is made 
of the operation to be performed, along with the informa- 


We present here the results of some experiments we have 
performed in simulation on the system described above. 
The simulator is described in [10]; space precludes a full 
discussion here. Briefly, waypoints are determined by con- 
figurations of walls and image signatures are simulated by 
noisy samples of wall ‘color’. Wherever possible, worst- 
case assumptions were made with respect to sensor and 
effector noise; all such parameters are adjustable. The 
performance of the mapper was quantitatively evaluated 
by measuring a posteriori position error — the error inher- 
ent in allowing the robot to rely on the map to determine 
its expected position after each move. We measure this 
by calculating the sum-of-squared- distance (SSD) between 
the robots actual relative motion and predicted relative 
motion for each track after a move. If the system is ef- 
fective at mapping, we expect the average SSD per move 
to asymptotically converge to a small constant. There are 
other useful performance metrics discussed in [9], but the 
different methods give qualitatively similar results — space 
does not permit inclusion here. 

Figure 4(a) shows a typical small environment used for 
evaluation. The world was designed to be confusing; every 
waypoint looks the same as every other. For each run, the 
robot was controlled by an essentially random walk, while 
the mapper ran in the background. A move is defined as 
a sequence of actions ending with the robot in a distin- 
guished waypoint (here, corners or doorways). Each run 
was 700 moves; a good maps were generally learned within 
300. As Figure 4(b) shows, SSD position error starts out 
high, but quickly begins to converge to a low asymptote 
(non-zero due to inherent odometric error). This demon- 
strates the effectiveness of the system in a confusing envi- 
ronment, even with no mapper control over the robot. 

The use of exploration scripts were tested by randomly 
executing them when applicable (generally 30% of the 
time). Comparative results are shown in Figure 4(c), which 
shows a significant improvement in mapper performance 
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Figure 4: (a) A simple, but confusing, robot environment 
(left) and a typical map learned (right), (b) SSD position 
error (y-axis) vs. number of waypoint visits (x-axis), av- 
eraged over 10 runs, (c) Improvement (in another, larger, 
world) by occasionally using exploration scripts (solid) vs. 
pure random walk (dashed). 


when exploration scripts are used. We expect a further 
improvement when we have taken into account the con- 
flict resolution between different scripts; we are currently 
investigating how to compare scripts so that the most use- 
ful gets executed. This question is connected to the larger 
question of assigning utility to exploration and experimen- 
tation, which is important in terms of the deliberator’s 
tradeoffs between goal-achievement and mapper script ex- 
ecution. This will form an important focus of our work in 
the nestr future. 

7 Related Work 

Much research on navigational mapping deals with prob- 
lems of local metric representation (eg., [3; 7; 2]), which 
is generally unsuitable in large-scale spaces. Kuipers and 
Byun first develop the notion of a topological place graph 
based on ‘distinctive’ locations [14]. However, while they 
go to some length to avoid error creeping into the map by 
using active experimentation, there is no provision for er- 
ror correction. Levitt ei ah also utilize a topological map 
which avoids accumulation of navigation error, by using 
local reference frames based on landmarks [15]. However, 
their system appears to depend heavily on reliable land- 
mark acquisition. Miller and Slack use information gener- 
ated for reactive local navigation to build rough geometri- 
cal maps of rocky terrain [17]. Their maps are notable in 
that they can directly be used for reactive navigation. 

Basye ei ai develop a probabilistic theoretical frame- 
work for map learning [4]. They probabilistically elimi- 
nate errors in the learned map by using active exploration, 
assuming limited directional certainty and globally recog- 
nizable places. However, their methods use very simple 
models of perception and action and do not use the rich 
geometrical and perceptual structure available. Hence they 
are forced to use strongly active strategies to learn maps 
reliably. 

Our methods for dealing with mapping errors can also 
be incorporated into existing mapping systems with mini- 
mal modification. Virtually any system that uses a place 




Figure 5: A complicated world and a learned map, learned 
after 3000 moves, with exploration scripts. 


graph representation can be reformulated in our terms. 
Specifically, Sarachik’s system for visual navigation [19] is 
particularly apposite. Her system visually recognizes room 
shapes and finds doors, linking them together in a place 
graph. A room’s perceived shape can be used as its percep- 
tual description; its position can be described in a locally 
determined reference frame. Our error correction machin- 
ery could then be applied virtually as is. 

Mataric [16] uses constraints derived from knowledge of 
the robot’s underlying behavior to derive a topological map 
based on linear graph segments. Yeap [22] describes a hi- 
erarchical topological map, with place nodes described by 
2D geometric models. Braunegg [5] develops a similar style 
of map, where rooms are characterised by the geometric 
arrangement of vertical edges, measured by stereo vision. 
Kriegman [12] describes a method for visually instantiating 
generic models of the robot’s surroundings, such as hall- 
ways, bringing top-down constraints to bear on geometric 
interpretation. 

8 Discussion 

In this paper, we have shown how map-learning can 
be organized around the principle of passive mapping. 
Passive mapping involves two important components: 
error-tolerant representations and explicit error- correction 
strategies. In addition, exploration can be helpful, but 
should be optional. This can be implemented through the 
use of exploration scripts, as described above. 

Our mapping system, as we have described, is a pas- 
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sive mapping system designed for indoor environments. In 
the future, we want to extend this work to other types of 
environments, such as city streets or forests. At present, 
though, this work is directly applicable to some real-world 
mapping tasks, such as those involved in inter-office de- 
livery or some search- and-rescue tasks. Implementation of 
complete systems that could be deployed for such tasks is 
not yet feasible; it awaits other developments in effective 
perception and robust action. 
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ABSTRACT: This paper extends a local sensor 
based planning method for hyper- redundant robot 
mechanisms. In a previous paper, sensor feedback 
control methods are considered. A highly localized 
sensor feedback method for hyper-redundand manip- 
ulators is termed, partial shape modification (PSM). 
A PSM utilizes a mechanism’s hyper-redundancy to 
enable both local obstacle avoidance and end-effector 
placement in real time. This paper considers the sit- 
uation in which the limits of a PSM is violated. In 
other words, what does the robot do when it can not 
only locally adapt to the environment. Local sensor 
based planning has been implemented on a thirty de- 
gree of freedom hyper-redundant manipulator which 
has eleven ultrasonic distance measurement sensors 
and twenty infrared proximity sensors. The robot is 
controlled by a real time control computer which com- 
municates with sensors through an innovative sensor 
bus architecture. Experimental results obtained us- 
ing this test bed show the efficacy of the proposed 
method. 


1. Introduction 

This paper extends experimental results from [TCB] 
in the area of local sensor based planning for hyper- 
redundant robot manipulators. Recall from [ChB90b] 
that a “hyper-redundant” manipulator is a kinemat- 
ically redundant manipulator in which the degree of 
redundancy is very large or infinite. Such robots are 
analogous in morphology to tentacles, an elephant’s 
trunk, a monkey’s tail or a snake. “Sensor based 
planning” incorporates sensory information into some 
stage of a robotic motion planning, whether it be 
navigation, locomotion, grasping, etc. “Local Sen- 
sor Based Planning” fine tunes a robot’s plan, based 
on sensor information. Local sensor based planning 
is useful when: (1) the robot only has a coarse knowl- 
edge of the world because of limited memory; (2) 
the robot’s world model contains inaccuracies; and 
(3) the world is subject to unexpected occurrences or 
rapidly changing situations. These situations can be 
overcome with local sensor based planning strategies. 

Due to their many degrees of freedom, hyper- 
redundant robots are potentially superior for opera- 
tions in highly constrained and unusual environments 
encountered in applications such as inspection of nu- 
clear reactor cores, chemical sampling of buried toxic 
waste, and medical endoscopy. Hyper-redundant 
robots can also be used as tentacle-like grasping de- 
vices for capturing and manipulating floating satel- 
lites [ChB90c] or to enable complex “whole arm ma- 


nipulation.” Mobile hyper- redundant robots also of- 
fer novel means for locomotion [ChB91a, ChB93a, 
ChB93b, ChB93c] in complex environments. 

The above mentioned applications are characterized 
by environments which are difficult to precisely model 
and which are time varying. Thus, local sensor-based 
motion planning schemes are vital to the realistic de- 
ployment of hyper-redundant robots in these applica- 
tions. While hyper-redundant robots have many ad- 
vantages for the above described applications, they 
have one disadvantage. Since hyper-redundant ma- 
nipulators have a large number of joints or actuators, 
small joint displacement errors can accumulate to rea- 
sonably large errors in the position of the tip relative 
to the base. Thus, the effective accuracy of hyper- 
redundant robots could be improved by distributing 
sensors along their length and employing sensor based 
planning schemes. 

Thus, local sensor based planning can be used to: 
(1) account for spatial uncertainty or inaccuracies in 
the world model used by a “global” planner to con- 
struct a robot plan; (2) increase the effective accuracy 
of a hyper-redundant robot mechanism; and (3) lo- 
cally adapt to rapid environmental variations, such 
as moving obstacles, that can not be easily or rapidly 
handled by a global planner. 

The local sensor based planning algorithm of hyper- 
redundant manipulators is based on the analysis 
found in [ChB90b, ChB91a, ChB92a, ChB92c, Ch]. 
This work has been demonstrated on an actual ex- 
tensible 30 degree-of-freedom hyper-redundant robot 
system. A hyper- redundant manipulator which can 
vary its length, within the limitations of its actuators, 
is termed “extensible.” A more detailed account of 
this mechanism and its capabilities can be found in 
[ChB92b]. 

Robotic motion planning has been an important area 
of research. Since the introduction of configuration 
space methods [LoWes], several other theories have 
been published, some of which are summarized in 
[Sh,Lat]. However, these methods plan from a per- 
fect model of the world, which is normally unavail- 
able to a real robot. More recently, methods have 
been developed in which the robot explores the envi- 
ronment to gather information for the planning pro- 
cess [CaLi]. These approaches assume that the sen- 
sors provide perfect information about the environ- 
ment. There has been little work devoted explicitly 
to motion planning for robot snakes. One approach is 
based on the construction of tunnels through the ob- 
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Stacie field, through which the manipulator “slithers” 
[ChB90a, Ch]. In another work, sensor based plan- 
ning for highly redundant robots is based on a tacirix 
[ReLu]. However, this work assumes that there are 
perfect sensors on the robot; nor has it been imple- 
mented on a real robot. Hirose [HiU] implemented 
an “active cord” mechanism, which used tactile sen- 
sors to guide its motion. A previous paper [TCB] 
presents preliminary strategies for local sensor based 
planning, which are implementable in real time, can 
employ a variety of sensors, and exploit the benefits 
of hyper- redundancy. 

In this paper, a local sensor based planning strategy 
for hyper- redundant manipulators is extended so it 
can be more accommodating. The local sensor bcised 
planner in [TCB] did not consider its own limitations. 
There, the hyper- redundant manipulator can only lo- 
cally adapt to a changing environment over a fixed 
length of the robot. For a fixed length of robot, a 
hyper-redundant manipulator uses its extensibility so 
it can locally avoid obstacles. However, the robot can 
only deform until its joint limits are met, in which 
case the hyper-redundant manipulator can no longer 
adapt. In other words, the robot used up all of its ex- 
tensibility over the fixed length. The longer this fixed 
length, the more the robot can deform. However, the 
longer the length, the less local the response, which 
is undesireable. In the new algorithm, the length of 
the deforming part of the robot is variable, there- 
fore enhancing its ability to locally adapt. In effect, 
the manipulator uses more of its extensibility from 
other parts of the robot to locally avoid objects. Ex- 
periments demonstrate that local sensor bcised plan- 
ning is not only useful, but also implementable in 
real time with very recisonable computing power and 
simple sensors. 

The structure of this paper is as follows. Section 2 
reviews the basic framework of hyper-redundant ma- 
nipulator kinematics which is based on “backbone 
curves.” The backbone curve and its deformation 
IS the basis for the algorithms of Section 3. We pri- 
marily consider algorithms for planar mechanisms, as 
the experimental verification of these ideas was per- 
formed on a planar robot. Many of these algorithms 
can be extended to the spatial case. Section 4 de- 
scribes the experimental setup, while Section 5 de- 
scribed the actual results of these experiments. 


2. Background 

This section reviews a hyper- redundant robot kine- 
matic analysis framework that forms the basis of this 
work. Recall from [ChB90b] that we assume that 
(regardless of mechanical implementation) the impor- 
tant macroscopic features of a hyper-redundant robot 
can be captured by a backbone curve. A backbone 
curve parametrization and an associated set of ref- 
erence frames which evolve along the curve are col- 
lectively called the backbone reference set In this 
paradigm, inverse kinematics and task planning re- 
duces to the determination of the proper time vary- 
ing behavior of the backbone reference set [ChB90bj. 


Similarly, local sensor based planning is equivalent in 
this approach to modification of the backbone curve 
shape in order to accommodate impinging obstacles. 

In [ChB92c], many techniques are introduced for 
parametrizing the backbone curve. In this paper, we 
will assume that the Cartesian position of points on 
a backbone curve can be parametrized in the form: 


c(s,t) = / 
Jo 


l(cr, t) ii(cr, t)da 


( 2 . 1 ) 


where s G [0, 1] is a parameter measuring distance 
along the backbone curve at time t. The backbone 
curve base is the point s = 0. x(s, t) is a vector from 
the backbone curve base to point s. By convention, 
x(0, t) = 0. k(s, t) is the unit tangent vector to the 
curve at $. l{s,t) is the length of the curve tangent 
and assumes the general form: 

l{s,t) = H-€(s,t) > 0. (2.2) 

e($,t) is the local extensibility of the manipulator, 
which expresses how the backbone curve locally ex- 
pands or contracts relative to a fixed reference state. 
We show later on that the robot needs this extensi- 
bility in order to locally avoid obstacles. 

The parametrization of Eq. (2.1) has the following 
interpretation. The backbone curve is “grown” from 
the base by propagating the curve forward along the 
tangent vector, which is varying its direction accord- 
ing to ti(s, t) and varying its magnitude (or ‘growth- 
rate^) according to l{s,t). 

Our experiments have been performed on a device 
with planar geometry. In the planar Ccise, the back- 
bone curve is the locus of points: 


where 


x{s,t) = [a;i(s,f), X2(s,t)]^ 

Xi(s,t)= / l{a^t)smO{a^t)da 
Jo 

X2{s,t) = / l{ayt)cos 6 {a^t)da. 
Jo 


(2.3) 

(2.4) 


0{s^t) is the angle, measured clockwise, which the 
tangent to the curve at s makes with the X 2 -axis 
at time t. By convention (2.4), ^( 0 ) = 0 , and 
xi(0) = X2(0) = 0. By comparing equations (2.1) 
with equations (2.3) and (2.4), it easy to see that 
u(s, t) = [sin^(s,t), cos0(s,t)j^ in the planar case. 
l{s) and 0(s) are termed “shape functions,” as they 
control they shape of the backbone curve through the 
forward kinematic relations (2.3) and (2.4). 


Within the context of this modeling technique, the in- 
verse kinematic problem, or “hyper-redundancy res- 
olution” problem, reduces to the determination of 
the time varying behavior of backbone curve shape 
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functions that satisfies task requirements. Different 
hyper-redundancy resolution techniques can be found 
in [ChB90a, ChB91a, ChB92a, ChB92c, Ch]. In one 
approach, which is relevant to the algorithm of Sec- 
tion 3, the backbone curve shape functions are re- 
stricted to a “modal form” 

Ne Ni 

t=l 

(2.5) 

where $t(5) is a “mode function,” and at(f) is the as- 
sociated “modal participation factor.” N = N$ Ni 
is the total number of modes, which must equal 
or exceeds the number of task constraints. Hyper- 
redundancy is resolved in the modal approach by con- 
straining the backbone curve to N effective DOF. The 
{$i} are predetermined functions chosen by the pro- 
grammer, and can often be selected to incorporate 
physical characteristics of the task [Ch]. Thus, the 
backbone curve geometry becomes solely a function 
of the {oi}. The inverse kinematics problem reduces 
to finding the {ui} which satisfy task constraints. In 
[ChB91a, ChB92c], closed form solutions are given 
for several choices of mode functions. 

A continuous backbone curve inverse kinematic solu- 
tion is used to determine the actuator displacements 
of a continuous morphology robot such as one con- 
structed from pneumatic actuator bundles. For dis- 
cretely segmented morphologies, such as the proto- 
type described in this paper in Section 4, the contin- 
uous curve solution can be used, via a “fitting” pro- 
cess, to compute the actuator displacements which 
cause the manipulator to exactly assume or closely 
approximate the continuous backbone curve model. 
The fitting techniques which are used in subsequent 
examples are reviewed in [ChB91a, ChB92cj. 


3. Local Sensor Based Planning Algo- 
rithm 

Local Sensor-Based Planning (LSBP) assumes that 
a backbone curve is somehow determined by a high 
level global planning process. The backbone curve 
shape is then modified in response to sensory infor- 
mation. LSBP does not use a model of the environ- 
ment, and is intended for rapid response to environ- 
ment changes. In order to describe the local sensor 
based planning strategy, a sensor model must be de- 
scribed. 

3.1. Sensor Models 

The algorithm described below uses a very simple sen- 
sor model. We assume that the sensors are rigidly at- 
tached to the backbone curve at a fixed point. That 
is, they move with the backbone curve, and their ori- 
entation is a function of the backbone curve tangent 
at the point of attachment. The sensors are assumed 
to measure, along a fixed direction termed the sen- 
sor measurement axis^ the distance to a nearby ob- 
stacle. The sensor measurement axis is a function 
of the sensor and the backbone curve geometry (See 


Fig. 1). Our sensors do not measure the distance 
to the point on the obstacle which is nearest to the 
backbone curve. Rather, they measure the distance 
which would actually be computed by realistic sen- 
sors. This simple model is representative of the in- 
frared and ultrasonic sensors discussed in Section 4. 
In addition, there is often some directional ambiguity 
due to the finite width of a typical sensor’s beam pat- 
tern. We assume that the sensor measurement axis 
is the centerline of the beam pattern. The distance 
measurement returned by the sensor is the nearest 
point of the obstacle lying within beam pattern cone. 
Since it is impossible to resolve the angular ambigu- 
ity, we assume that nearest point of the obstacle lies 
along the beam pattern centerline. 



Figure 1; Simplified Distance Mea- 
surement Sensor Model 


3.2. Partial Shape Modification Con- 
trol 

This section describes a PSM planning strategy in 
which the backbone curve is approximated by a large 
number, n^, of discrete endpoints. The sensors are 
assumed to be rigidly attached at points along the 
discretized backbone curve. There are typically many 
approximating segments between adjacent sensor at- 
tachments. When a sensor detects the presence of an 
obstacle, the backbone curve shape locally deforms 
in a region around the sensor. In our simulations 
and experiments, Ud 100, and there were about 10 
discrete points between sensor points. 

The actual response, a displacement of the approx- 
imating points, is determined by a local sensor re-- 
s'ponse function (LSRF), which is assumed to be a dis- 
crete unimodal function.^ A unimodal function is one 
which has one local maximum, the global maximum, 
over its domain. The response function is “added” to 
the current backbone curve, locally drawing it away 
from an obstacle. In Figure 2, a triangle LSRF is 
added to a straight backbone curve, deforming the 
backbone curve away from an sufficiently close ob- 
stacle. 

Since the robot only detects the obstacle at a sensor 
point, the reaction to the obstacle should be greatest 
at the sensor point, and should monotomically de- 
crease at points away from the sensor point. There- 
fore, the sensor point is the center of this unimodal 
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function, and is assumed to be farthest away from the 
obstacle. 

The LSRF should also look like the beam pattern of 
the sensor associated with the sensor point. Typi- 
cally, beam patterns have a central lobe along the 
sensor axis, in which the obstacle likely lies. In the ex- 
perimental setup, the spatial resolution of the robot’s 
actuators is much lower than the azimuth resolution 
of the sensors on the robot. Therefore, a simple tri- 
angle (or cone) is a sufficient approximation to the 
main lobe of the beam pattern, and thus, a recison- 
able choice for a LRSF. Later on, it is shown that a 
triangle response function leads to a trivial and ef- 
ficient solution to LSBP for planar hyper-redundant 
manipulators. So, the example displayed in Figure 2 
is a good example of a LSRF. 

The half width of the LSRF is slightly larger than the 
distance between two adjacent sensors on the back- 
bone curve. This way, if two adjacent sensors detect 
the same obstacle, their cumulative response function 
is still unimodal. 



Figure 2: Backbone, 

Response Function, and 
Deformed Backbone 


In this approximation method, the position of the dis- 
crete segment endpoints can be approximated by the 
discretization of the continuous forward kinematics 
integral (Eq. (2.1)): 

k=i 

p{si,t) = ^ l{sk,t)u{sk,t) (3.2.1) 

k=0 

/(s) and T?(s) are continuous shape functions which 
are specified by a global planner. They need not as- 
sume a modal form. Also, an endpoint may or may 
not coincide with a sensor point. 

A small differential change in p{si) is: 

k—i 

Sp{si,t) = ^2 Sl{sk,t)u{sk,i) l{sk,t)5u{sk,t) 
k:=0 

(3.2.2) 


where Sk = where Ud is the number of discrete 
points along the back bone curve. 5u is a local change 
in the backbone curve tangent direction, while 51 rep- 
resents a local stretch. 

The goal of this PSM method is to compute the local 

perturbations, Su and 51, which deform the backbone 
curve away from obstacles. The changes in backbone 
curve tangent and stretch are determined from 
which in turn is determined by the LSRFs. The mod- 
ified backbone curve shape is then used by the fitting 
algorithms to determine the appropriate actuator dis- 
placements. 

Assume that at some initial time, a global planner 
specifies a backbone curve shape. Thus, Sp = 0 ini- 
tially. For each sensor point along the backbone curve 
that detects an obstacle within its response envelope, 
a discrete unimodal LSRF is added to (or subtracted 
from, depending upon from which direction an ob- 
stacle appears to be) the vector which contains 
the prescribed changes to the backbone curve. Set- 
ting 5p{sn,t) = 0, guarantees that, within the lim- 
its of the discretization approximation, the end effec- 
tor position will not change. Setting Jp(sn,t) = 0, 
5p{sn-iyt) = 0, and (5il(5„,t) = 6 guarantees that 
the end effector position and orientation will not 
change. The new backbone curve can be computed 

after Su and 51 are determined from (3.2.1). 


In the case of a planar backbone curve, (3.2.2) can 
be written in matrix form: 



( 3 . 2 . 3 ) 

where 5p^ = 5p(sj,t), 5it^ = 5u{siyt), 5l{ = 5/(si,t), 
fj = f( 5 i), and iT = u{si^t). The x and y compo- 
nents of 5p{si^t) are respectively denoted and 
5py. Similary, the x,y components of 5u{si,t) are 
and 5uy, 5p and 5u each have 2n elements, and 
51 has n elements. 

For given there is not a unique solution to 
Eq. (3.2.3). A simplified (and unique) solution for 
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(3.2.3) is obtained by setting l{si,t) = 1 Vi<i<n (i.e 
Sl{siyt) = 0). Although all of the = 1, the 

snake can still use its extensibility to avoid obstacles 
because ||iT ^ 1. In other words, the length of 

the tangent vectors are no longer constrained to have 
unit length in this approximation unless additional 
restrictions are employed. After setting Sl{si^t) = 0 

and enforcing the end effector constraints, (3. 2.3) be- 
comes: 
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( 3 . 2 . 4 ) 

which has a simple, obvious and easily computed so- 
lution to (3.2.4). 


< = pi-pi~^ (3.2.5) 

wj, = pt - pt"' (3.2.6) 

The discretized backbone curve is then used, via a 
fitting procedure, to compute the local actuator dis- 
placements which implement the desired deforma- 
tion. Figure 3 displays a PSM deformation of a 30 
DOF variable geometry truss manipulator (kinemati- 
cally identical to the real system described in Section 
4) in response to an impinging obstacle. The back- 
bone curve is approximated by 100 segments. The 
manipulator is originally in a straight configuration, 
which locally deforms to avoid a simulated obstacle 
in Figure 4. In this simulation, the response function 
is shaped like a triangle. 



Figure 3: OriginaJ| Figure 4: Resultingl 


3.3. Extended PSM 

Due to mechanical limitations of the robot, such cis 
joint limits, a real hyper-redundant manipulator has 
a limited amount of local extensibility. That is, ||£|| < 
T where T is the limit of local extensibility of the 
manipulator. This means that at any point, there is 
a fixed range over which the backbone can expand or 
contract relative to a fixed reference state. 


The original PSM cissumes that there is infinite local 
extensibility, which is unrealistic for actual robots. In 
the example in figure 5, an object becomes unaccept- 
ably close to the robot, which locally moves away 
from the object using an LSRF. However, the ob- 
ject continues to move towards the already deformed 
robot, which wants to move further away locally from 
the object, using the same LSRF. Although a back- 
bone curve is determined in this situation, it is not 
likely that a real mechanism can fit this curve because 
this backbone requires a lot of local extensibility from 
the manipulator. In this case, the ||6|| < T constraint 
was violated because ||5u(si,t)|| > T. 



Figure 5: Same Response 


So, a new LSRF has to be used which utilizes the 
extensibility of neighboring regions on the backbone 
curve, while not using any local extensibility from the 
already deformed section (i.e. maintaining the con- 
straint ||e|| < T). This is called “sucking” extensibil- 
ity from other parts of the robot snake. In figure 6, 
the local extensibility limit is not exceeded, and the 
robot is still able to accommodate for the object’s 
displacement. 



Figure 6: Modified Response 
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4. Experimental Setup 

To prove the feasibility of the proposed algorithms, 
a distributed sensor system was developed for the 30 
degree-of-freedom hyper-redundant robot system de- 
scribed in [ChB92b]. Figure 7 shows the structure 
of this testbed. This section describes the test bed 
structure in detail. 


Sparc staiion2 1 VME CAGE 



30 DOF Hyper Redundant M«nipuJ«ior ScMorlnlcrlice 


4.1. Hyper redundant manipulator and 
control system 

The hyper-redundant manipulator is a modular Vari- 
able Geometry Truss design [Ch]. The 30 degree-of- 
freedom (DOF) planar robot consists of ten modules 
(also called bays) of 3 DOF each (Fig. 7). Each DOF 
consists of a D.C. servo mptor which drives a lead 
screw. Each lead screw is instrumented with a lin- 
ear potentiometer. The real time system controller is 
based on a VME-bus multiprocessing computer, cur- 
rently consisting of two Heurikon (68030 and 68020) 
single board processors, and the VxWorks real time 
operating system. One processor is dedicated to the 
closed loop feedback control of the actuator positions. 
The other processor is dedicated to processing of sen- 
sor data and real time computation of the PSM algo- 
rithms. 

To enable flexible, modular, and easily expandable 
experimentation with sensor bcised planning, a novel 
34 wire “Sensor Bus” architecture wcls developed for 
the sensor system. One end of the sensor bus is con- 
nected to the PSM processor via a parallel port. The 
sensor bus consists of an eight bit outgoing data path, 
a four bit status line, a two bit strobe and one inter- 
rupt request line. The data path and the two strobe 
lines enable the CPU to access up to 256 sensors and 
to send eight bits of information to the sensor pe- 
ripherals for possible sensor control purposes. The 
interrupt request line is connected to the hardware 
counter on the CPU board so that accurate timing 
measurements can be made in real time. 

Sensors can be added to the system via “Sensor In- 
terface Modules.” This module decodes the sensor 
bus address and generates signals to control sensors. 
Up to two ultrasonic sensor modules and six sets of 
sensors which produce data with 4 bit (or less) quan- 
tization can be controlled. Currently, only four in- 
frared sensors per board are present, though up to 
eight infrared sensors and eight mechanical switchs 
can be directly connected. The sensor interface mod- 
ule circuitry is mounted on a printed circuit board 
which is 15cm by 12cm in size. Fig. 8 shows a photo- 
graph of the sensor interface module. The sensor bus 
is physically connected in the bottom of the module 
in a daisy-chain fashion. In the figure, two ultrasonic 
transducers are shown above the module. Also the 
infrared proximity sensor is shown on the side of the 
module. 

4.2. Sensors 

Currently, the robot has two types of sensors: in- 
frared (IR) and ultrasonic (US). There are five sets of 
two US sensors. Each set is rigidly attached to alter- 


Figure 7: Hyper Redundant Robot 
Test Bed 



Figure 8: Sensor Interface Module 


nate bays so that each sensor points outward from the 
backbone curve (or the centerline of the mechanism). 
An additional US sensor will be mounted at the front 
of the mechanism, pointing forward. Twenty IR sen- 
sors, again two at each the ten bays facing outward, 
are currently mounted on the snake. In the near fu- 
ture, twenty-four additional IR sensors will be added, 
two more at each bay, and four in front. Fig. 9 
schematically shows how these sensors are distributed 
throughout the mechanism. 


lofrired \ cone 




On* Bay 


L'ltruoaic Sensor 


Figure 9: Sensor Arrangement 
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Electrostatic type ultrasonic sound transducers deter- 
mine distance by measuring the time of flight of the 
ultrasound pulse leaving the transducer, bouncing off 
an object and returning to the sensor. A 50kHz sonar 
wave burst is transmitted when the sonar-ranging 
module is triggered [Ci]. The ranging module out- 
put is connected to the sensor bus interrupt request 
line. The echo return time is computed by the CPU 
hardware counter. Since the sixteen bit resolution 
distance measurement result is read from the hard- 
ware counter, no result is ever sent through the sensor 
bus. This design greatly simplifles the hardware re- 
quired. 

The US sensors are activated sequentially (at 16 mil- 
lisecond intervals) to prevent interference between 
sensors. These sensors are calibrated to measure dis- 
tances ranging from 10cm to 2.5m, with a 2% accu- 
racy. There is about twenty degrees of conical ambi- 
guity for direction, because of the transmitting beam 
pattern of the transducer [Ci]. In this work, it is 
assumed that the obstacle lies along the cone’s cen- 
terline, which is locally normal to the backbone curve 
at the point of sensor attachment. 

The IR sensors yield binary proximity information- 
i.e., the presence or absence of the obstacle in some 
pre-set range. An infrared LED emits modulated in- 
frared light, and if an obstacle is near the robot, the 
IR sensor w^ detect the reflected light. The range of 
the IR system can be adjusted by setting potentiome- 
ters on the sensor boards. Currently, the IR system 
is set up to detect the presence of obstacles up to four 
inches away from the robot. Like the US, the loca- 
tion of an obstacle is not precisely known, but lies 
somewhere in a cone emanating from the IR sensor. 

Each sensor has its own advantage. The IR sensors 
have a very fast response and can be sampled at ex- 
tremely high rates. They are thus suitable for the 
PSM system. The US sensors provide proportional 
obstacle distance, rather than binary proximity in- 
formation. They are thus more useful for accurate 
planning. However, since the US sensors are sequen- 
tially polled to prevent interference, the minimum 
sampling period is 176 milliseconds. The IR sensors 
are sequentially polled in a similar fashion, but at 
a significantly higher rate. To maximize the use of 
both types of sensors, the sensor interface module is 
designed to operate both US and IR sensors simulta- 
neously in different intervals. 

4.3. Remote Operation Console 

The real time computers are connected to Sun work- 
stations via the ethernet. Via software sockets, infor- 
mation can be transferred through the ethernet be- 
tween the real time computer running VxWorks and 
the Sun workstations running Unix. C programs and 
many software packages, such as Matlab, are able to 
directly communicate with the real time computers 
via the sockets. Therefore, these programs can con- 
trol the snake. The FSM and higher levels of control 
are implemented on the SUN workstation. 

Experimental robot control programs are developed 



Figure 10: Infrared Sensor 


in a combination of C and Matlab. Via an X- Window 
interface, these programs graphically display and con- 
tinually update the robot’s configuration and sensor 
measurements. Fig. 11 shows the X- Window opera- 
tion console window. In addition, many motion plan- 
ning and sensing commands can be executed using a 
graphical menu interface. End-effector via points of 
a hyper-redundant trajectory can be specified by a 
mouse, and the trajectory is then executed by the 
real-time system. 

In addition to graphically depicting the current con- 
figuration of the manipulator, this system displays 
US and. IR sensor measurements. The solid cones 
emanating from the manipulator represent US sen- 
sor data. In this representation scheme, the closet 
point to an obstacle in the sensor beam pattern lies 
somewhere on the distal arc of the cone. The dashed 
arcs much closer to the mechanism indicate that the 
IR sensors have detected nearby obstacles at these 
locations. 



Figure 11: Operation Console 
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5. Results 



The PSM algorithms described in Section 3.2 and 
3.3 have been implemented on our hyper-redundant 
robot test-bed. Photographs of two experiments are 
shown below. In the first experiment, the backbone 
curve, dictated by some high level planner, was a 
straight line. Two obstacles were moved into an un- 
acceptably close proximity (about 10cm or 4in) to the 
mechanism, and the manipulator locally deformed 
away from each obstacle while maintaining constant 
end-effector position. Truthfully, the end-effector was 
displaced slightly from its original position (less than 
a 1 inch displacement over a distance of '^^16 feet). 
The current implementation of the discrete approx- 
imation algorithm employs only IR sensors, because 
there are many more IR sensor distributed along the 
snake. See figure 12. 


Figure 12: First Experiment 


In the next experiment, again, the backbone curves 
starts off as a straight line. See figure 13. One obsta- 
cle was moved unacceptably close to the robot which 
resulted in the mechanism moving, as it did in the 
first experiment. See figure 14. Then, the object was 
moved sufficiently close to the deformed robot, pass- 
ing through the original backbone curve, and the ma- 
nipulator still deformed away. See figure 15. Such a 
large local deformation would not have been possible 
with the original PSM. 

Th second experiment showed the local shape modifi- 
cation capability of the new PSM algorithm proposed 
in this paper. In real time, the old PSM reliably 
works, when the actuators in the section of the robot 
that is deforming are not near their joint limits. In 
such a case, the new PSM is the same as the old PSM. 
As actuators’ limits are approached, local deforma- 
tion may become infeasible, and this is where the new 
PSM becomes useful. The new planner uses extensi- 
bility from neighboring actuator displacements along 
the manipulator so, the robot can still locally deform. 
However, once all the actuators reach there limit, i.e. 
all the extensibility is used up, the robot has to report 
to a higher level planner in order to accommodate for 
all the constraints in the environment. This ability is 
currently being implemented in our system. 



Figure 13: Second Experiment 
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Figure 15: Final 


6. Conclusion 

In this paper, a local sensor based planning method 
for hyper^redundant robots is extended. This method 
is based on a backbone curve kinematic framework. 
In the previous work, the limit of local deformation 
was limited to the extensibility over a fixed portion 
of the robot. In this paper, in order to better com- 
pensate for objects penetrating the backbone curve, 
extensibility was used over a variable portion of the 
robot. 

This method was implemented on an actual 30 DOF 
hyper-redundant manipulator test bed. An innova- 
tive sensor bus architecture and a graphical program- 
ming and display interface were reviewed. Experi- 
ments using this system showed the applicability and 
effectiveness of the proposed method to real hyper- 
redundant manipulators. A reasonable amount of 
computer power was required for real-time implemen- 
tation of these algorithms. 

As suggested in the previous section, we are currently 
working to improve the communication between low 
level planners and a high level planner so that the 
robot can better interpret and react to exceptional 
conditions indicated by the PSM level. In addition, 
we intend to develop better sensor function meth- 
ods which properly combine ultrasonic and infrared 
sensor readings from adjacent sensors. The highly 
distributed nature of sensors on a hyper-redundant 
mechanism also point to the need for new theories 
on deploying and using massively redundant sensor 
arrays. Finally, future work will focus on using sen- 
sor data for higher level, i.e. global, hyper-redundant 
robotic planning. 
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Abstract 

This paper investigates the fault-tolerant control of 
hyper-redundant spatial manipulators. The standard re- 
solved rate control law using the pseudoinverse is modi- 
fied to account for joint failures. To combat the problem 
of extremely high joint velocity solutions generated near 
singular configurations by the pseudoinverse, the singu- 
larity robust inverse is employed. A method to com- 
pute an optimal scale factor for the robust inverse is 
derived. Simulation results of this approach applied to 
an 11 DOF manipulator are presented which verify the 
validity of this approach. 

1 Introduction 

Recent advances in the field of robotics has resulted 
in redundant manipulators gaining increased attention 
due to advantages over conventional manipulators such 
as increeised dexterity. Hyper-redundant manipula- 
tors represent the next step in manipulator evolution. 
These manipulators posses a large number of Degrees- 
Of- Freedom (DOF). This paper investigates the kine- 
matic control of hyper-redundant spatial manipulators 
with particular emphasis on fault-tolerant control. 

The redundant structure of such manipulators en- 
dows them with many desirable properties, such as 
fault- tolerant features. The redundancy can be ex- 
ploited to seamlessly complete end-effector tasks in 
the face of single or multiple joint motor failures. A 
rate-inverse kinematic control algorithm utilizing the 
pseudoinverse is presented that is insensitive to arbi- 
trary joint motor failures. Another feature of redun- 
dant manipulators is the possibility of optimal joint- 
motion coordination. However, redundant manipula- 
tors are plagued by the presence of singular configura- 
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tions. In these configurations first order motion in a 
certain end-effector direction is not possible, and are 
associated with loss of manipulator Jacobian matrix 
rank. Standard kinematic control algorithms fail at or 
near singular configurations. The singularity problem 
is also considered by implementing a singularity robust 
kinematic control algorithm that is insensitive to joint 
failures. The robustness is determined by a scaling fac- 
tor. An optimal method to pick the scaling factor is 
also derived such that end-effector tracking accuracy is 
sacrificed in order to not violate joint velocity limits. 
Simulation results of this approach applied to an 11 
DOF spatial manipulators are presented which verify 
the validity of the proposed methodology. 

2 Manipulator Kinematics 

In this section, a brief review of manipulator spatial 
kinematics is presented. For an n-link, serial, open- 
loop spatial manipulator, denote the space of joint co- 
ordinates by q E Q = T”. The corresponding end- 
effector position and attitude is denoted by x G A' = 

X 50(3). For a redundant manipulator, n > 6, The 
end-effector motion kinematics are given by: 

i = [ p 

Here, p denotes the end-effector position, oj is the end- 
effector angular velocity expressed in an inertial Carte- 
sian reference frame, and J{q) is the 6x n “constructed” 
Jacobian transformation matrix. 

All manipulators posses singular configurations in- 
side their workspace. These configurations, q^ , are 
manifest when the Jacobian matrix loses full rank, 
i.e. rank{J) < dim{A!), In terms of the Singular 
Value Decomposition (SVD) of the Jacobian matrix, 
J = this corresponds to at least one singular 

value CTi = 0. The corresponding singular direction Ui 


= m q 
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(the i-th column of U) is the direction in which end- 
effector motion to first order is instantaneously impos- 
sible. At a singular configuration, a solution for the 
joint rates cannot be constructed from the first order 
approximation of the forward kinematics. Hence, kine- 
matic control algorithms that rely on inversion of the 
Jacobian matrix in one form or the other generate ex- 
tremely large joint rate solutions near or at a singular 
configuration. 


4 Resolved Rate Law For Failed Joints 

The solution to the quadratic optimization problem 
in the previous section assumed that all the joints were 
active or had not failed. In this section a modified ver- 
sion of the pseudoinverse which accounts for joint fail- 
ures is derived. Assuming joint i fails, this implies that 
qi = 0, The incidence of a failed joint can be expressed 
as an additional constraint of the form: 


3 Inverse Rate Kinematics 

Manipulator tasks are described in end-effector 
space, A', while actuator commands are in joint-space, 
Q. For this reason, an inverse kinematic algorithm is re- 
quired to generate joint-angle commands from the end- 
effector commands. The inversion can be carried out 
at different differential levels of the forward kinematics. 
In this paper only first order inversion or Inverse Rate 
Kinematics (IRK) will be considered. Kinematic con- 
trol algorithms that use this approach are also known 
as Resolved Rate Control (RRC) algorithms. 

For inverse rate kinematics, the end-effector velocity 
profile along the desired trajectory is specified. At each 
increment, the joint rates are computed by “inverting” 
the Jacobian matrix evaluated at the current configu- 
ration. These joint rates are then integrated (usually 
via Euler integration) to generate the next set of joint 
angles. Assuming the current configuration is nonsin- 
gular, the general form of the solution for joint rates 
is, 

4k = J^Qk) -f 


— Omxl 


The m X n constraint matrix. A, is a zero matrix with 
A(i, j) = 1 for the j-th failed joint with a total number 
of failures equal to m. To illustrate this, for a 9- link ma- 
nipulator with joints 4 and 7 failed, the corresponding 
A matrix has the form: 


0 0 0 1 0 0 0 0 0 
000000100 


To derive the modified pseudoinverse to account for 
failed joints, the quadratic optimization problem (2) is 
reformulated as: 


min 0.5 q^q (2) 


subject to J{q)q 
subject to Aq 


X 

Omx 1 


The solution of this new optimization problem (2) is, 
q = B J'^ [J B X (3) 


where the notation denotes a generalized inverse and 
V is the instantaneous self motion (or null motion), i.e. 
J{qk) = Omxi- For nonredundant manipulators — 
and V = 0„xi- 

For a redundant manipulator [j J^] 

known as the pseudoinverse of J, and i/ ^ Onxi- The 
pseudoinverse originates from a 2-norm joint rate min- 
imization problem. Specifically it is obtained as the 
solution to the following quadratic cost optimization 
problem: 


min 0.5 q^q (1) 

subject to J(q)q = x 

In the redundant case, there are n— 6 degrees of freedom 
in the nullspace of the Jacobian matrix that can be 
assigned arbitrarily. In both cases, the joint angles are 
then obtained from: 


qk+i = qk + 4k At 


where: 


B = 


Inxn 


A^{A A 


Comparing this solution with the one for all joints free, 
it is seen that by setting A = Omxn the pseudoinverse 
solution is obtained. 


5 Robust RRC For Failed Joints 

The standard IRK solution utilizing the pseudoin- 
verse fails at or near singular configurations due to ex- 
tremely large joint rate solutions. It has been shown 
[2] that the pseudoinverse does not generate singular- 
ity free trajectories. Therefore, the pseudoinverse ap- 
proach does not posses any singularity avoidance prop- 
erties. Since solving (2) can drive the system to a singu- 
lar configuration, one approach to alleviate this prob- 
lem is to relax the equality trajectory constraint. 

The Singularity Robust Inverse (SRI) (or damped 
least squares) proposed in [6], and [8] as an alternative 
to the pseudoinverse accomplishes a tradeoff between 
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accuracy and feasibility of solution. Near a singular 
configuration the joint rates remain finite and bounded 
in exchange for a build-up in tracking error. The robust 
resolved rate law is obtained from the solution to the 
constrained minimization problem, 

min O.be^We (4) 


where: 


e 

W 


X - J{q)q ' 
Q 

W, Oexn 

0nx6 


The weighting matrices, W\ (6 x 6) and W 2 (n x n), are 
arbitrary. Assuming W\ = lex 6 and W 2 = « Inxn, the 
solution to (4) is given by [6]: 

g = J*X= [ + «/]■' (5) 


In the above, J* is the Singularity Robust Inverse 
(SRI). An alternative but equivalent expression for J* 
is [6]: 

q = J*x = [ J + kI j ^ X (6) 

Setting the scaling factor /c = 0 in (11), it is seen that 
J* reduces to the standard pseudoinverse jf. The er- 
ror vector e represents the tradeoff between accuracy 
of solution (expressed by e(l)) and feasibility of solu- 
tion (expressed by e(2)). The tradeoff parameter is the 
scaling factor, /c, which is the degree of freedom in the 
formulation of the SRI control law. In the following 
section, an optimal method to chose the scaling factor 
is presented. 

The standard formulation of the SRI assumes that 
all joints are free. To derive the modified SRI to ac- 
count for failed joints, the optimization problem (4) is 
reformulated as: 


min O.be^We (7) 


subject to Aq = Ornxi 

The solution to this new optimization problem (7) is 
given by, 


q = p-^ 
where: 


I -A^ [AP-^A^] ^AP-^ 


j'^Wi X 

( 8 ) 


P = j'^Wi J + W 2 


It will be assumed that W\ = lex 6 ^nd W 2 = k Inxn- 
It is noted that the first term in the right hand side of 
(8) is just the standard SRI solution, while the second 
term accounts for the effect of the failed joints. 


5.1 Determining the Scaling Factor 


The robustness properties or the SR-Inverse are de- 
termined by the scaling factor /c, which expresses the 
tradeoff between the exactness and feasibility of solu- 
tion. Depending on the particular emphasis of an ap- 
plication, several methods for choosing the scaling fac- 
tor have appeared in the literature [6], [8], [4]. These 
methods adjust the scaling factor as a function of some 
Jacobian based metric such as the minimum singular 
value or the manipulability measure. Hence, the scal- 
ing factor is adjusted based on the proximity of the 
manipulator to a singular configuration. 

Another approach is to chose the scaling factor such 
that an appropriate norm of the joint rates does not 
exceed a predetermined threshold [3], [5], Such an ap- 
proach directly takes into account the maximum allow- 
able joint velocity constraints while minimizing the end- 
effector deviation from the desired trajectory. When- 
ever the pseudoinverse (or modified pseudoinverse in 
the case of failed joints) does not violate the joint ve- 
locity threshold (i.e. the solution is feasible) scaling is 
not required and k — 0. In this case the SRI solution 
is not required. When the pseudoinverse solution vio- 
lates the velocity norm threshold (i.e. the solution is 
infeasible) the SRI can be used to generate a feasible 
solution which minimizes the deviation from the speci- 
fied end-effector trajectory if the choice of scaling factor 
satisfies: 


\\q\\m = \\ j'^ [j J'^ + K l] ‘ i l|m < qmax (9) 


In (9), the notation || • \\m denotes the vector m-norm. 
Hence, the problem of generating joint rates that do 
not violate a rate limit is posed as follows: 

if II J^x II^Tj < qmax « = 0 else solve 

II = W (10) 


Obtaining a closed form solution to (10) is not practi- 
cally possible as it is a nonlinear problem. An iterative 
approach to solve (10) for m = 2 has been presented 
in [3] and [5]. However, an approximate closed form 
solution can be obtained which is described in the fol- 
lowing. 

Using the singular value decomposition (SVD) of the 
Jacobian matrix, J = (10) can be written in 

the form, 


II q ||„ = II V St t/^ i lU = W (11) 
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where: 




0 

0 


(Tj + K 


0 

0-2 

n 


<T2 + K 


Et = 

0 

0 

0 

0 

<7-6 

<r'i + K 

0 






To simplify (11), the joint velocity norm can be upper- 
bounded using induced matrix norms [7]: 

II 9 lU < II V 11,^ II St lU II i llm (12) 

In (12), the notation || • \\im denotes the induced (m) 
norm corresponding to the vector norm || • Wm- Using 
(12), a conservative relation involving the joint velocity 
norm threshold is: 

II Wim II St lU II u'^ X lU < 9max (13) 


norm. Assuming that maximum absolute velocity limit 
is the same for all joints, (14) reduces to: 


max —5 < 

* (Ti 


Qmax 

IIV^U ||t/^x||oo 


(15) 


For a fixed value of the scaling factor «, as a singular 
value (Ti approaches zero, the expression (Ti/{(Tf + k) 
increases until it reaches a maximum value when <Xi — 
y/n and then decreases to zero. Therefore, 


max — s < 

* (Ti + K 



Hence, a conservative solution for the scaling factor that 
will satisfy (15) is: 


K > 


V 


X 


1 2 


2 ^ma 


(16) 


In actual implementation, the equality is used in com- 
puting the scaling factor. Similarly, if the joint velocity 
threshold is expressed in terms of a 2-norm, the solution 
for the scaling factor is given by: 


Solving for from (13): 


II Him < 


Qma 


U '^ X 


(14) 


The relation (14) is the key to solving for the scaling 
factor in order to enforce the joint velocity norm lim- 
its. Typically, joint velocity thresholds are specified as 
either a 2-norm or oo-norm constraint. For a 2-norm 
rate limit, the induced 2-norm of is [7], 

II si ||i2 = 

where: 

^max ^ E^^E^ ^ = Maximum eigenvalue of E^^E^ 

For an oo-norm rate limit, the induced oo-norm of E^ 
is [7]: 

II si II, 00 = 2 i^ljl 

j 

Due to the special structure of E^ and the fact that 
eJj > 0, it is eeisy to show that in both ceises its induced 
norm reduces to: 


II si lb = II si lU = 

t (T, K 

In applications, usually the maximum absolute joint 
velocity is limited instead of the quadratic velocity 


K > 


II 11.2 II U'^ i II 2 

2 Qmax 


2 


6 Numerical Simulations 

In this section, simulation results of the techniques 
described in this paper applied to an 11 DOF spatial 
manipulator are presented. This is a special modular 
manipulator developed at NAS A/ JSC [1] with a 15 foot 
reach. Its architecture is described by the joint se- 
quence RPR-PR-PR-PR-RPR where R denotes a roll 
joint and P denotes a pitch joint. 

The first simulation example is used to illus- 
trate the fault-tolerant resolved rate law using the 
modified pseudoinverse (3). The initial config- 
uration of the manipulator is given by qq = 
[45, -60, 0, -45, 0, 45, 90, 45, 0, 0, 0, 0]. The ini- 
tial configuration is shown in Figure 1. The end-effector 
command is x = [ -5.5, -2.5, 1.5, 0, 0, 0 ] with units 
of in/sec for the translational component and deg/sec 
for the rotational component. In the following figures, 
the end-effector position is displayed as, solid line for 
x-axis, dashed line for y-axis, and dotted line for z-axis. 
The end-effector attitude is given as a Pitch, Yaw, Roll 
(PYR) Euler angle sequence, with solid line for pitch, 
dashed line for yaw, dotted line for roll. For all joints 
free throughout the simulation, the results for the stan- 
dard pseudoinverse are shown in Figures 2 and 3. Fig- 
ure 2 shows the end-effector position and attitude while 
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Figure 3 shows the joint angle trajectory. Now consider 
the case when joints 1,6, and 7 have failed throughout 
the simulation. The results for the modified pseudoin- 
verse (3) are shown in Figures 4 and 5. Figure 4 shows 
the end-effector trajectory which is identical to the all 
joints free case. Figure 5 shows the joint angle trajec- 
tory. It is seen that joints 1,6, and 7 do not move as 
would be expected. 

To illustrate the SRI with joint velocity thresh- 
olds, consider a maneuver of moving the end-effector 
in the x-direction. The initial configuration is qo = 
[0, -50, 0, -30, 0, 70, 0, 30, 0, 0, 0]. The end- 
effector command is x = [-5, 0, 0, 0, 0, 0]. First 
consider the case when all joints are free throughout 
the simulation. The results of using the standard pseu- 
doinverse are shown in Figures 6 and 7. Figure 6 shows 
the position and attitude history of the end-effector, 
while Figure 7 shows the joint rate profile. It is seen 
that the maximum joint rate is less than 6 deg/sec. 

Now consider the case when joints 2,4, and 7 have 
failed throughout the simulation. The results of using 
the modified pseudoinverse, (3), are shown in Figures 8 
and 9. The end-effector trajectory shown in Figure 8 is 
seen to deviate somewhat from the desired trajectory. 
This is due to the very high joint rate solution (up to 
400 deg/sec) generated by (3) shown in Figure 9, and 
the particular integration approach used to generate 
the end-effector trajectory. The results of using the SRI 
with a maximum absolute joint rate limit of 50 deg/sec, 
II Q llioo £ 50 deg f sec, are shown in Figures 10 to 
12. The end-effector trajectory is shown in Figure 10 
where the deviation from the desired trajectory appears 
in the x and z-axes and pitch angle. The main effect 
of using the SRI is the reduction in system response in 
the commanded direction. From Figure 10 it is seen 
that at the end of the simulation the manipulator has 
only moved halfway in the x-direction. The joint angle 
trajectory is shown in Figure 11 from which it is seen 
that the joint rate limit is not exceeded. Finally, Figure 
12 shows the history of the SRI scale factor. It is seen 
that the SRI solution is activated at approximately 2 
seconds. 

7 Conclusion 

This paper has addressed the fault tolerant kine- 
matic control of hyper-redundant manipulators. The 
standard resolved rate control law utilizing the pseu- 
doinverse was modified to account for joint failures. 
However, pseudoinverse based control laws fail at or 
near singular configurations due to extremely high joint 
rate solutions. To generate feasible joint motions near 
singular configurations, the Singularity Robust Inverse 
(SRI) instead of the pseudoinverse was employed. The 


standard formulation of the SRI was modified so as to 
account for the possibility of failed joints. As the ro- 
bustness properties of the SRI are determined by the 
choice of scaling factor, an optimal method to pick the 
scaling factor was presented. Numerical simulations 
were presented utilizing an 11 DOF spatial manipulator 
to illustrate the proposed solution methodologies. The 
simulation results verified the validity of the proposed 
methods. 
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Figure 2: Pseudoinverse solution for all joints free: 
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Figure 5: Pseudoinverse solution for all joints free: 
Joint angles 
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Figure 3: Pseudoinverse solution for all joints free: 
Joint angles 


Figure 6: Pseudoinverse solution for all joints free 
End-effector position and attitude 
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Figure 7: Pseudoinverse solution for all joints free: 

Joint rates 
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Figure 8: Pseudoinverse solution for 3 failed joints: 
End-effector position and attitude 
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Figure 10: SR-In verse solution for 3 failed joints: 

End-effector position and attitude 
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Figure 11: SR-Inverse solution for 3 failed joints: Joint 
rates 
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Figure 9: Pseudoinverse solution for 3 failed joints: Figure 12: SR-Inverse solution for 3 failed joints: Scale 

Joint rates Factor 
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AhaiTaci— The high cost involved in the retrieval and 
repair of robotic manipulators used for remediating nu> 
clear waste, processing hazardous chemicals, or for explor- 
ing space or the deep sea, places a premium on the reli- 
ability of the system as a whole. For such applications, 
kinematically redundant manipulators are inherently more 
reliable since the additional degrees of freedom (DOF) may 
compensate for a failed joint. In this work, a redundant ma- 
nipulator is considered to be fault tolerant with respect to 
a given task if it is guaranteed to be capable of performing 
the task after any one of its joints has failed and is locked in 
place. A method is developed for insuring the failure toler- 
ance of kinematically redundant manipulators with respect 
to a given critical task. Techniques are developed for an- 
alyzing the manipulator’s workspace to find regions which 
are inherently suitable for critical tasks due to their rela- 
tively high level of failure tolerance. Then, constraints are 
imposed on the range of motion of the manipulator to guar- 
antee that a given task is completable regardless of which 
joint fails. 

I. Introduction 

Kinematically redundant manipulators have been 
proposed for use in the cleanup and remediation of nu- 
clear and hazardous materials, as well as for remote ap- 
plications such as deep space or sea exploration, where 
repair of broken actuators and sensors is impossible and 
the probability of their failure is increased due to the harsh 
operating environment [2], [3]. In these situations the ex- 
tra degrees of freedom of a redundant manipulator may be 
used to compensate for the failed joints if the manipulator 
has been properly designed and controlled. The most basic 
task of a manipulator, i.e. the positioning/orienting the 
end effector in the workspace, is described by the forward 
kinematic equation 

X = m, (1) 

where x G is the generalized vector of the posi- 
tion/orientation of the end effector and 0 £ is the 
vector of joint variables. In this framework, point to point 
tasks can be described by a series of end-effector positions 
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to be obtained at desired times, i.e., x(<j), with a kine- 
matic inverse equation 

e = r\x) ( 2 ) 

being solved to determine the corresponding required joint 
values, A kinematically redundant manipulator can, 
in general, satisfy an end effector positioning constraint, 
x(tj), with an infinite family of joint values satisfying (2). 
The underlying premise for advocating the use of redun- 
dant manipulators for critical applications is that if a joint 
should fail, then the redundancy of the manipulator may 
permit the completion of the task. Although commercial 
manipulators currently are not equipped with the neces- 
sary circuitry to detect failures and apply the brakes to 
any failing joint, the need for such a mechanism is well 
known [12], [13]. If failed joints are locked, then, a sin- 
gle joint failure reduces the number of degrees of freedom 
(DOF) of the system by one, and the new kinematic func- 
tions /() and their inverses differ markedly from the 
original ones. 

In [12] a method is described for designing manipu- 
lators to be fault tolerant with regards to a given point 
to point task. They assume that any joint may fail any- 
where within its entire range of motion. A manipulator 
is said to be fault tolerant with respect to a given set of 
task points x(f,) only if there exist solutions to (2) for ev- 
ery possible failure. With this assumption, the worst case 
typically occurs when a failing joint is folded in on itself. 
In the work described here, failure tolerance is achieved 
by imposing constraints on the motion of all joints prior 
to a failure. By judiciously selecting the specific solution 
from the family of solutions to (2), the worst case need 
not occur. Thus failure tolerance may be achieved with 
less complex manipulator designs, and for manipulators 
not originally designed with failure tolerance in mind. 

An alternative to defining the manipulator’s task as 
a sequence of end-effector positions is to specify the end- 
effector velocity profile. At the velocity level, the kine- 
matic equations relating the joint rates 0 to the end- 
effector’s velocity x are given by 

i=^je (3) 

where J G is the manipulator Jacobian matrix 

which is a function of the manipulator’s configuration. 
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The solution for all joint rates that satisfy the desired end- 
effector velocity can be represented by 

^ = J+ir-h(/- J+J)2 (4) 

where indicates the pseudoinverse, (/ — J"*" J) is the pro- 
jection onto the null space, and z represents an arbitrary 
vector in the joint velocity space [8]. The second term in 
this equation clearly indicates that there is a family of joint 
trajectories that satisfy (3). However, unlike the kinematic 
function /() relating the joint values to the end-effector’s 
position, the Jacobian for the failed system is easily de- 
rived from the original system’s Jacobian by zeroing the 
column of the failed joint. Using this fact it is possible to 
develop an inverse kinematic function which insures that 
the manipulator will have some degree of local dexterity 
after an arbitrary joint failure [7]. The measure of dex- 
terity in this case is defined as the smallest singular value 
of the Jacobian, so that a kinematic failure tolerance 
measure, ib/m, is given by 

kfm{0)= min <Tm(^ J) (5) 

/=l-n 

where ^ J is the manipulator Jacobian matrix for the sys- 
tem with its /’th joint locked. Having a large value for 
kfm{0) insures that after an arbitrary joint failure the 
manipulator will still be able to satisfy an arbitrary de- 
sired end-effector velocity in the vicinity of the failure. 
Unfortunately, this measure is inherently local in nature 
and can not guarantee that the complete trajectory re- 
mains feasible after the failure. However, it will be shown 
that the local failure tolerance measure, kfm(0)^ can be 
used to guide the search for regions within the workspace 
for which one can insure that the entire desired task can 
be completed regardless of joint failures. 

The remainder of this paper is organized as follows. 
First, a method for analyzing the fault tolerance of a given 
location in the workspace is discussed. Second, the con- 
straints necessary to guarantee fault tolerance for a single 
point are described. Third, a procedure that uses the local 
measure of fault tolerance to identify candidate regions of 
the workspace where critical task should be placed is dis- 
cussed. Then, a method for determining the constraints 
necessary to guarantee the fault tolerance of the manipu- 
lator with respect to the given critical path is outlined. 

II. Surfaces of Self-Motion 

For a kinematically redundant manipulator the fam- 
ily of joint configurations satisfying (1) forms an (n — m)- 
dimensional hyper-surface in the n-dimensional configura- 
tion space of the manipulator [1],[6]. Joint motion con- 
strained to this hyper-surface does not affect the posi- 
tion/orientation of the end effector so that these hyper- 
surfaces are frequently referred to as self-motion mani- 
folds. The null space of the manipulator’s Jacobian given 
by the set of vectors satisfying (3) with x = 0 defines the 
tangent plane to the self-motion manifold. As a simple 
example, consider the 3 DOF planar manipulator shown 



Fig. 1 . A three degree of freedom planar manipulator with equal 
link lengths. 


3DOF Planar Null-Curves 
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Fig. 2. The set of joint conAgurations having the manipulator’s end- 
effector at a single location form curves in the configuration space of 
the manipulator. These curves are the self-motion surfaces for a 3 
link planar manipulator. The self-motion surfaces for some regions 
of the workspace are markedly larger than others. Points with large 
self-motion surfaces tend to be more failure tolerant. 

in Fig. 1 for which the self-motion manifolds are one- 
dimensional curves. For this manipulator a projection of 
the self-motion curves onto the $2 — ^3 plane is shown in 
Fig. 2. Each curve represents the family of joint variable 
combinations which place the end-effector at a constant ra- 
dius from the base. From the figure, it is clear that some 
regions of the workspace have larger self-motion manifolds 
than others. The two extremes occur at the boundary 
of the manipulator’s workspace, which corresponds to the 
point at the origin of the configuration space, and on the 
circle which is centered at the base and has a radius of 1 
meter. In the first case, the self-motion surface vanishes 
to a point. This fact indicates that the manipulator will 
not be capable of reaching the original boundary after any 
joint failure. At the other extreme, the self-motion curve 
spans the entire range of joint values, even in which 
is not shown. This fact is significant since regardless of 
which joint fails, or where it fails, the manipulator will 
always be capable of tracing out the unit circle with it’s 
end-effector. It is interesting to note, that the local failure 
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tolerance measure, (5), reaches its exact theoretically opti- 
mal value on the self-motion surface of this globally failure 
tolerant point. Also note that kfm{0) = 0 at the reach 
singularity. These attributes lead to the use of kfm as a 
first pass when evaluating the workspace in order to place 
critical task. Clearly, when a joint fails and is locked, the 
manipulator is more likely to be able to reach points with 
large self-motion surfaces than those with small ones. 

To guarantee that a manipulator is able to return to a 
desired workspace location, one must, in general, constrain 
the motion range for each of the n joints. The minimum 
and maximum joint values of the ith joint, denoted 
and respectively, can be determined from the mini- 

mum and maximum values of di over the entire self-motion 
manifold. This effectively superscribes an n-dimensional 
box aligned with the joint axes around the self-motion 
manifold. The size of this bounding box is an indication 
of the inherent failure tolerance of the workspace point 
for which it was computed. If the manipulator fails while 
operating within the bounding box of a given desired end- 
effector location x, then it will always be able to posi- 
tion its end-effector at that point regardless of where the 
end effector is located when the failure occurs. For exam- 
ple, consider again the 3DOF manipulator, for which the 
bounding boxes associated with the self-motion surfaces 
for the three workspace points labeled A, By and C in 
Fig. 1 have been drawn in Fig. 2. Note, that although 0i 
and it^s associated boundaries are not shown, they need 
to be considered. If the manipulator fails while within the 
boundary of any one of the bounding boxes, then the ma- 
nipulator will always be able to position its end-effector at 
those points regardless of which joint fails. The region of 
the configuration space which lies inside all three bounding 
boxes is of particular interest. If the manipulator operates 
within this region, then regardless of which joint fails, it 
will be able to reach all three points. Unfortunately, it is 
not possible to reach points B and C and stay within this 
region of the joint space, however, it should be clear that 
obtaining the bounding region for self-motion manifolds 
reveals the potential failure tolerance of various locations 
within the workspace. 

Several iterative methods exist in the literature 
for characterizing one dimensional self-motion curves 
[1],[4],[5],[9]. For systems with two or more degrees of 
redundancy an estimate of the size of the self-surface may 
be obtained by using a Jacobian iteration of the form 


e = ±{I- J+ J)e^ -h J+(x" - x) (6) 

where Cj is a unit vector along the iih joint axis and x"^ is 
the workspace end-effector location being evaluated. The 
first term represents motion along the self-motion manifold 
until the tangent to the manifold becomes orthogonal to 
the joint axis direction e,*. The second term is required 
to compensate for any errors that are accumulated during 
the iterative procedure [5]. This method is effective for 
one-dimensional self-motion curves as in the 3D0F planar 
czise, but may yield an insufficiently low estimate for self- 
motion manifolds of higher dimensions. 


For a two-dimensional self-motion surface, a simple 
and effective method for estimating the bounds of the self- 
motion surface is to iteratively trace out a linearly increas- 
ing spiral on the self-motion surface. Keeping track of the 
values obtained by each joint along the spiral provides an 
estimate of the bounding box containing the self-motion 
surface. A non-escaping spiral, depicted in Fig. 3, has a 
parameterized equation of the form 


^ ^ 

r = j<l> 

where v is the velocity along the spiral, r and (j> are the 
polar coordinates of the spiral, and j controls the distance 
between successive rotations. Since this particular spiral 
passes within a controlled distance from every point in the 
plane, when it is transformed onto the self-motion surface 
it will tend to fill the surface. The iterative transformation 
procedure from parameter to configuration space is given 
by 

6 = sin(<^)ii„_i + cos(^)t)„ + J'^{x* - x) (8) 

where t)n-i and Vn are orthogonal unit vectors that span 
the null-space of the manipulator's Jacobian evaluated at 
the current configuration. The vectors t)„ and t)n_i can be 
computed as the singular vectors from the singular value 
decomposition of J. Since t)„_i and are not unique, 
one must be careful to ensure that vectors chosen are the 
ones nearest to those of the previous iteration. For ex- 
ample, if the current singular vectors are represented by 
Vn-i and Vn then once (8) is evaluated and used to up- 
date the manipulator configuration, the new Jacobian will 
in general have different singular vectors and To 
accurately reflect the continuous rotation of these two vec- 
tors as the null space rotates, one can use the following set 
of equations 


« n-l = -^*"1 + (1 - 

t)„ = (1 — X)wi — Xu>2 


where 


A = 


(w{Vn-ir 
(wj’vn-iy + 


( 10 ) 


where wi and W 2 are any unit vectors that span the new 
null space. Note, that the sign should be examined to 
select the smallest resulting rotation. An ideal algorithm 
for computing the SVD that automatically calculates the 
continuous rotation of the null space is presented in [11]. 
An illustration of this technique for mapping out a two- 
dimensional self-motion surface is presented in Fig. 4. 
This figure shows a three-dimensional projection of the 
five-dimensional configuration space for a PUMA used in 
three-dimensional positioning tasks. 


III. Joint Constraints to Guarantee Fault Tol- 
erance 

As was indicated in the previous section, a workspace 
location, x*, may be guaranteed to be reachable regardless 
of joint failures if the manipulator is constrained to oper- 
ate within the associated self-motion manifold’s bounding 
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Null-Surfaces for Different Workspace Points 



Fig. 3. A linearly increasing spiral p£isses within a controlled 
distance from every point in the plane, and thus it may be used to 
estimate the bounds of a 2D surface in an n-dimensional space. 



Fig. 5. The set of joint configurations for a kinematically redimdant 
manipulator that yield identical end-effector positions is a surface in 
an n-dimensional space. A spiral traced out on the tangent plane 
defined by the null- vectors of the manipulator’s Jacobian reveals 
the shape of the self-motion surface for any given point. Here, two 
distinct points are shown, one having a large self-motion surface, and 
one with a small one as indicated by their bounding boxes. 




box. This is evident since regardless of which joint fails, 
by definition there exists at least one configuration on the 
self-motion manifold associated with x* that corresponds 
to the joint value at which the joint failed. Therefore, 
3D Slice of the Spiral on the Self-Motion Surface the problem of maintaining the fault tolerance of a given 

critical location reduces to that of maintaining joint limits 
specified by the bounding box of the self-motion manifold 



for that location. This problem was first solved in [8] by 
using (4) and selecting z to result in motion away from 
the joint limits. The vector z may be computed by com- 
bining smooth functions so that the joint limits only effect 
the manipulators motion when it is near the constraint 
boundaries [10]. 

To maintain a high degree of fault tolerance, one 
would like to locate critical task points in locations where 
the self-motion manifold bounds are large. For instance 
jigs and fixtures in general should not be placed near the 
workspace boundaries since joint failures will render such 
tegions unreachable. Although the tedious chore of mea- 
suring the size of the self-motion manifolds throughout the 
workspace could be done off-line, it has been found that 
the local measure of fault tolerance, is a good 


indicator of size of the self-motion manifolds. 


Fig. 4. 3D Slice of the spiral traced out on the self-motion surface To insure that a task defined by a sequence of crit- 

in joint space for a PUMA 560 robot used only for positioning. ical points may be performed regardless of joint failures, 

each point must be analyzed, the associated range of its 
self-motion surface determined, and then the intersection 
of the ranges for each point computed to determine the 
required joint constraints (see Fig. 5). Finally, it must be 
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verified that the manipulator is able to reach each critical 
point while maintaining these constraints. 

In summary, the following procedure is used to guar- 
antee the failure tolerance of a redundant manipulator 
with respect to a critical task. First, the workspace is an- 
alyzed using the local failure tolerance measure (5). Sec- 
ond, critical task are placed in regions of the workspace 
that have high values of local failure tolerance. Third, 
the bounding boxes for the self-motion surfaces associated 
with each critical location are determined using the pro- 
cedures outlined in section II. Fourth, the intersection of 
the bounding boxes is calculated to determine the required 
constraints. Fifth, each critical workspace point is checked 
to determine if the manipulator is capable of positioning 
it’s end-effector at the desired location while maintaining 
the constraints imposed by the intersection of all bounding 
boxes. Finally, (4) is used with the joint limit constraints 
to insure the failure tolerance of the manipulator for the 
specified task. 

IV. Conclusions 

This paper has developed a method for insuring the 
failure tolerance of kinematically redundant manipulators. 
In this work, a redundant manipulator is considered to be 
fault tolerant with respect to a given task if it is guaran- 
teed to be capable of performing the task after any one 
of its joints has failed and is locked in place. Methods 
were developed for analyzing the manipulator’s workspace 
to find regions which are inherently suitable for critical 
task due to their relatively high level of failure tolerance. 
Then, the required constraints were imposed on the range 
of motion of the manipulator to guarantee that a given 
task is completable regardless of arbitrary joint failures. 
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Abstract 

The Procedural Reasoning System (PRS) is a gen- 
eral purpose reasoning system that is particularly 
suited for use in domains in which there are prede- 
termined procedures for handling the situations that 
might arise. We have just completed an implemen- 
tation of PRS written in C-h+, which we call the 
University of Michigan Procedural Reasoning System 
(UM-PRS). In this paper, we show how UM-PRS pro- 
vides a critical level of representation for robotic ap- 
plications in unpredictable domains, because it allows 
robotic vehicles to pursue long-term goals by adopt- 
ing pieces of relevant procedures depending on the 
changing context, rather than having to blindly follow 
a prearranged plan. Specifically, UM-PRS has been 
used to control a real outdoor vehicle that changes its 
behavior based on what it sees in its environment. In 
turn, this provides the substrate for coordinating mul- 
tiple robotic vehicles, allowing them to represent joint 
procedures and to infer each others plans through ob- 
servation. 

Introduction 

We have been involved in a project which will, at its 
terminus, result in a team of robotic vehicles that are 
capable of autonomously working together while per- 
forming reconnaissance and other militarily relevant 
tasks. High-level plans for these vehicles will typi- 
cally be in the form of mission plans and annotated 
maps. A mission plan is a declarative representation 
of the vehicles’ major goals. Based on this, a human 
annotates a map showing topographical and strategic 
features to indicate where along planned routes some 
changes in each robotic vehicle’s behaviors must be 
made (turned on, turned off, or parameters modified). 
Thus, the annotated map represents a “program” to 
be executed by the vehicles, where crossing certain 
spatial lines trigger a vehicle to execute the next step 
of its program. 

The annotated map representation® is incomplete 
in that it lacks the richness required for robotic con- 
trol in unpredictable, dynamic environments. Blindly 
following the preprogrammed sequence of behaviors 
might be hazardous (if the context in which the se- 
quence was formed changes) or impossible (if the ve- 

This research was sponsored in part by ARPA under 
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hide loses track of its position on the map or if its 
control is temporarily taken over by a human) . There- 
fore, it is important that the map and mission plan be 
accompanied by more general knowledge about why 
certain actions are being taken and in what context. 
Military doctrine, as laid out in documents such as 
field manuals, contains large numbers of standard op- 
erating procedures (SOPs) that should be selectively 
invoked as conditions and objectives change. In fact, 
an annotated map for a mission plan typically repre- 
sents a particular sequence of SOPs that are expected 
to be useful. For a flexible, autonomous system to 
succeed, however, the suite of SOPs must be avail- 
able to the system at runtime. 

The Procedural Reasoning System"*^ is a general 
purpose reasoning system particularly suited for use 
in domains in which there are predetermined proce- 
dures for handling the situations that might arise. 
This makes it very applicable in domains such as 
that discussed above. However, until recently, there 
has not been an implementation of PRS that is freely 
available for use. We have just completed an imple- 
mentation of PRS written in C-1-+, which we call the 
University of Michigan Procedural Reasoning System 
(UM-PRS). 

In this paper we discuss the details of our initial im- 
plementation. Our implementation is novel in terms 
of both knowledge representations and control struc- 
tures all written in C-f- h to meet the needs of efficient 
real-time robotic control. First, we briefly introduce 
the general concepts of procedural reasoning systems, 
the specific representations and the interpreter of the 
initial UM-PRS version. Second, we illustrate how 
UM-PRS serves as an important intermediate level of 
representation and reasoning between the high-level 
mission plan and the executable annotated map, and 
how it can interface with these levels. We also il- 
lustrate how the flexibility provided by UM-PRS al- 
lows autonomous responses beyond those permitted 
by annotated maps alone. Third, we briefly describe 
how UM-PRS has been used, to date, in the dynamic 
control of a real outdoor vehicle and how UM-PRS 
provides a basis for multi-vehicle coordination, both 
in terms of allowing the automated formation of be- 
lief networks that a vehicle can use to infer the plans 
of others through observation, and in terms of rep- 
resenting the collective activities of vehicles and the 
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roles that they can play. Finally, we summarize the 
current status of UM-PRS and the ongoing improve- 
ments that we are making in order to realize the full, 
flexible autonomous capabilities in our multivehicle 
system. 

Procedural Reasoning System 

Developing reasoning systems that can reason and 
plan in continuously changing environments in real- 
time is emerging as an important area of application 
of Artificial Intelligence. In this section, we describe 
basic features of the Procedural Reasoning System 
(PRS) that motivated us to adopt PRS as a concep- 
tual framework for our system. 

The Procedural Reasoning System"*^ is a general- 
purpose reasoning system, integrating traditional 
goal-directed reasoning and reactive behavior. Be- 
cause most traditional deliberative planning systems 
formulate an entire course of action before starting 
execution of a plan, these systems are brittle to the 
extent that features of the world or consequences of 
actions might be uncertain. In contrast, the Procedu- 
ral Reasoning System continuously tests its decisions 
(both high- and low-level) against its changing knowl- 
edge about the world, and can redirect the choices 
of actions dynamically while remaining purposeful to 
the extent of the unexpected changes to the environ- 
ment. 

PRS thus is not a planning system in the traditional 
AI sense, in that PRS does not concentrate on search- 
ing for sequences of primitive actions that lead to spe- 
cific goals. Instead, PRS is a plan execution system: 
it assumes that it already has “plans” (procedures) 
for achieving various goals in various contexts; how- 
ever, it might string together actions in unexpected 
ways as it dynamically chooses among procedures and 
subprocedures in a changing environment. 

Typically, accomplishing a mission in a military set- 
ting is much more similar to the plan execution ac- 
tivities of PRS than to the activities of traditional 
planning systems. Procedures for military tasks such 
as reconnaissance are developed and learned off-line,^ 
and training involves mastering the selection and ex- 
ecution of predefined procedures. These procedures 
may contain knowledge about both cognitive (such as 
situation assessment) and physical actions, and they 
can be arbitrarily complex. Often, a “step” in one 
procedure (such as “move to assembly area”) might 
itself correspond to several sub-procedures, each ap- 
propriate in different contexts. 

PRS is conceptually geared for representing pre- 
cisely this kind of procedural information. Several 
features that make PRS particularly powerful as a 
situated reasoning system^ are as follows: 

• The semantics of its plan (procedure) representa- 
tion, which is important for verification and main- 
tenance. 

• Its ability to expand and act on partial plans. 

• Its ability to pursue goal-directed tasks while be- 
ing responsive to changing patterns of events in 
bounded time. 



Figure 1: PRS System Structure^ 


• Its facilities for managing multiple tasks in real 
time. 

• Its default mechanisms for handling the environ- 
ment’s stringent real-time demands. 

• Its metalevel (or reflexive) reasoning capabilities. 

PRS consists of (1) a database (called World Model) 
containing current beliefs or facts about the world; 
(2) a set of current goals to be realized; (3) a set of 
plans (called Knowledge Areas) describing how cer- 
tain sequences of actions and tests may be performed 
to achieve given goals or to react to particular situ- 
ations; and (4) an intention structure containing those 
plans that have been chosen for eventual execution 
(Figure 1). An interpreter (or reasoning mechanism) 
manipulates these components, selecting appropriate 
plans based on the system’s beliefs and goals, placing 
those selected on the intention structure, and execut- 
ing them.^ 

The system interacts with its environment, includ- 
ing other systems, through its database (which ac- 
quires new beliefs in response to changes in the envi- 
ronment) and through the actions that it performs as 
it carries out its intentions.^ 

UM-PRS 

In our particular implementation of PRS, we have 
been concerned to a large extent with the specific re- 
quirements of the military task domain we have out- 
lined, and which we will discuss further below. Thus, 
whereas some versions of PRS have been developed 
(typically in Lisp) to be extremely general, our goal 
has been to identify the crucial features of PRS for 
our application domain, and to implement them in a 
robust way in C-l-d-. Some of the design and imple- 
mentation decisions that we have made follow. 

World Model 

In our implementation, WM is a database of facts 
which are represented as relations. A relation has a 
name and a variable number of fields. Initial facts 
are asserted at the beginning of a UM-PRS program 
by the user, and other facts can be either asserted or 
retracted by the KAs, which will be explained below. 
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Knowledge Areas 

A Knowledge Area (KA) is a declarative procedure 
specification of how to satisfy a system goal or query. 
It consists of the purpose (a goal, query, test, or world 
model assertion or retraction) for executing the KA, 
the context in which the KA is applicable, a graph- 
ical network called the body which specifies what is 
required to satisfy the purpose in terms of primitive 
functions, subgoals, conditional branches, etc., and a 
symbol table which holds values for variables when a 
KA is instantiated for a specific situation. The con- 
text consists of a mixed sequence of patterns to be 
matched against the WM and expressions to be satis- 
fied using the variable bindings generated during the 
matching. 

A SOAK (Set Of Applicable KAs) is a collection of 
KAs which have been instantiated to achieve a goal 
(purpose) that has just been activated. Each KA in 
the SOAK is applicable to the specific situation, as 
one role of the context is to filter out KAs that are 
not relevant to a particular situation. 

The body describes the procedure’s steps, consist- 
ing of a network of actions. The body can be viewed 
as a plan schema. The schema is instantiated with 
the bindings which are generated when the purpose 
and the context of the KA are checked during SOAK 
generation. 

Actions 

Actions are the arcs in a KA body that consti- 
tute either primitive actions or subgoals to achieve. 
A “primitive” action is a behavior or activity that 
can be executed directly. Any other type of action 
represents a) a goal that needs achievement, mainte- 
nance, or to be waited upon b) a query, c) a test, or d) 
an assertion or retraction of world model information. 
Actions are represented by a base class that holds in- 
formation regarding the KA in which the action is 
found and the action name. Each of the types of ac- 
tions are represented by a derived class that maintains 
such information as function pointers (for a primitive 
action), the expression to evaluate (for a test), a goal 
or query expression, or a world model relation to as- 
sert or retract. 

Goals 

Goals in UM-PRS are the world states that the 
system is trying to bring about (or maintain, etc.). 
A goal can be either a top-level goal, which controls 
the system’s highest order behavior, or a subgoal ac- 
tivated by the execution of a KA arc. 

Intention Structure 

The intention structure acts as the run-time stack 
for the system. It keeps track of the progress of each 
high-level goal and all of the subgoals. The intention 
structure suspends, resumes, cancels, and proceeds 
with execution of goals in much the same way as an 
operating system. The intention structure maintains 
information about what KAs are currently active, as 
well as what actions in each KA are to be executed 
next. As there are conditional branches in a KA, the 


intention structure must also maintain information re- 
garding the success or failure of branches. 

The Interpreter 

The UM-PRS interpreter is similar to the inter- 
preter described for PRS: It is what controls the exe- 
cution of the entire system. Whenever there is new or 
changed information in the world model or goal list, 
the interpreter determines a new SOAK. From this 
SOAK is selected the most appropriate KA, which is 
placed in the intention structure. When there are 
no SOAKs being generated, the interpreter checks 
the intention structure for the currently active KAs 
and executes the next primitive action. If this ac- 
tion changes the goal list (by creating a subgoal or 
by satisfying a goal) or world model, a new SOAK is 
created and the cycle starts over. If a new SOAK is 
not created, then the next arc in a leaf-level KA is 
executed. 

With this implementation, the interpreter facili- 
tates switching to more important goals according to 
the situation. This implementation also stays com- 
mitted to one method of achieving a goal by not recon- 
sidering alternatives unless the current method fails. 
The UM-PRS interpreter is different from the PRS in- 
terpreter in that there is currently no metalevel con- 
trol, although we plan on adding that in the near 
future as we enrich the set of KAs such that we could 
have many KAs applicable in overlapping situations. 

Example 

We began by briefly describing the incomplete- 
ness of the annotated map representation^ in unpre- 
dictable, dynamic environments. In this section, we 
show examples of using both the annotated map rep- 
resentation and the UM-PRS system. We first show a 
clear correspondence between the annotated map rep- 
resentation and the UM-PRS representation when ge- 
ographical events are the main triggers of the actions. 
In the second example, we show the limitation of the 
annotated map representation and, in contrast, the 
richness of UM-PRS when general (non-geographical) 
events trigger actions. 

Figure 2 is an example annotated map representa- 
tion of a simple scenario of getting to an observation 
point from the assembly area using road following and 
STRIPE (a waypoint following method) alternatively. 
The actions to be taken are annotated along the cir- 
cles in the map. Figure 3 is an example UM-PRS 
knowledge area that generalizes the procedure on the 
annotated map. The actions in the KA body roughly 
correspond to the sequence of actions represented in 
the annotated map, and the KA representation (con- 
text and body) is somewhat simplified to highlight 
the correspondence between the two representations. 
A full detailed working example is presented in the 
Appendix. 

An important advantage to using the procedural 
representation over the annotated map representation 
is that the correspondence between mission objectives 
and map markings is made explicit. That is, with an 
annotated map, the (human) mission planner has a 
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NAME: "Get to OP" 

PURPOSE: (ACHIEVE get_to_OP $op) 
CONTEXT: (FACT assembled "True") 
BODY: 

o 

(EXECuW Depart) 


o- 

(QUERY Ari|ived_OP $value) 


o 


(ACHIEVE 


,Follow_Road) 


o 


(TEST (== (TESt^*;^vaIue "False")) 

o b 

(ACHIEVE RSTA) (ACHIEVE STRIPE) / 


o 


o 


Figure 3: UM-PRS KA 


mission in mind, along with an operating procedure to 
accomplish it. The overall operating procedure is not 
explicitly represented anywhere, but only the steps 
are given in the annotations. However, the human 
interface using a UM-PRS-based system can be quite 
different. The human can specify an objective, and 
UM-PRS will retrieve appropriate KAs. In order to 
instantiate those KAs, UM-PRS must bind variables 
to values, and those values could include geographi- 
cal information (where the assembly area is, or which 
road to follow). Thus, while the user will certainly 
still point to locations and regions on a map, he or 
she will do so in response to the needs of the explicitly- 
represented, standard operating procedures currently 
being elaborated by UM-PRS. 

The second example scenario is shown in Figure 4. 
The vehicle starts out by issuing a command to start 
the road-following behavior. This moves the vehicle 
forward until it reaches a cone or goes past a maxi- 
mum allowed distance. If it passes the max distance 
without seeing the cone, the vehicle stops and the 
demo is done. If the vehicle detects the cone, it ap- 
proaches the cone and starts off-road behavior until 
it sees that it has reached the end point. When the 
vehicle has reached the end point, the demo is done. 

The idea of the demo is to show that UM-PRS can 
be used to represent conditional actions based on non- 
geographical events such as cone detection. The cone 
can be placed anywhere along the road, or there may 
be no cone at all. Such non-geographical events are 
hard to annotate in a map. The full KA description 
of this demo is presented in the Appendix. Note that 


the general capability of pattern-directed invocation 
of UM-PRS makes it possible to represent high-level 
choices of very different behaviors as well as simple 
non-geometric events. So, for example, representing 
procedures for context changes such as “running for 
cover” after having “been discovered by enemy” is 
easy to represent in UM-PRS, rather than cluttering 
up regions of the map with annotations. 

Experiments 

In this paper, we have described our implemen- 
tation of UM-PRS. Currently, we are experimenting 
with using UM-PRS for robotic control of two indoor 
mobile robots, and also of an outdoor robotic vehicle 
(Figure 5). The scenario being used for our experi- 
ments is a reconnaissance task in a military domain. 
Typically, the means of satisfying the task’s goals are 
described in terms of procedures to follow in specific 
situations. These procedures correspond exactly to 
KAs, with the context and the purpose specific to the 
situation in which it is applicable. 

One thing we have to note is that all the actions de- 
scribed in the KA body should eventually map down 
to primitive actions (C or C-h~h functions) which are 
directly executable on the real robot or vehicle. Since 
our implementation is done in C-h-h, the interface be- 
tween KA actions and real primitive control functions 
is very natural and efficient. 

UM-PRS: Toward Multi- vehicle Coordination 

Many tasks of interest, particularly in the military 
domain, require the concerted efforts of several play- 
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ers. In other words, they require teamwork. Recently, 
Georgeff ’s colleagues^ have explored extensions to the 
procedural representation to model different “roles” 
played in team procedures, and have developed al- 
gorithms for assigning roles to potential team mem- 
bers. These capabilities need to be incorporated into 
UM-PRS to capture the notion of roles in military 
procedures, such as the roles of “bounder” and “over- 
watcher” in a bounding-overwatch procedure. Since, 
in such a procedure, the vehicles take turns watch- 
ing and moving as they leapfrog across an area, the 
roles will be assigned and reassigned dynamically in 
the course of the procedure. 

Having each adopted its role in a shared, team pro- 
cedure, a vehicle can then work to achieve its goals 
in the procedure. However, since how it chooses to 
achieve its goals can impact the choices available to 
other vehicles in how they achieve their goals, some 
additional coordination is often needed. In essence, 
while the vehicles might commit to a team plan at an 
abstract level, they also might have to commit ahead 
of time to how they might (or might not) elaborate 
their plans into detailed actions. Anticipating and 
making necessary commitments ahead of time (be- 
fore they move into the field where communication 
is more risky and error prone) should be done, but 
overcommitment should be avoided lest the vehicles 
commit to specific courses of action that they later 
find to be suboptimal or even ineffective. Thus, while 
the conceptual framework of PRS emphasizes delayed 
commitment to action until the action must be taken, 
coordination requires some degree of commitment to 
the future. Extending UM-PRS to provide this capa- 
bility is one of our ongoing efforts. 

Also, once in the field, some coordination might be 
needed, and is typically done through communicating 
about plans, goals, or beliefs about the world. Each 
of the involved vehicles can reason about this infor- 
mation in order to detect possible conflicts, improve- 
ments, synergies, etc. As mentioned before, explicit 
communication may not always be possible, however, 
due to such situations as hostile vehicles in the vicin- 
ity, environmental noise, and broken equipment. Plan 


recognition is the process of inferring the same infor- 
mation (the motivating goals or beliefs of a vehicle) 
based upon sensory observation of that vehicle’s ac- 
tions. Once the plans have been inferred, the same 
reasoning process can be used to make decisions re- 
garding coordination. 

One of the most significant issues that arises is that 
plans of the robotic vehicles will be in the form of 
UM-PRS Knowledge Areas, which are not conducive 
to performing plan recognition. In response to this, 
we are developing a system that will automatically 
convert the plans of the other team vehicles into a 
representation amenable to plan recognition, namely 
belief networks. Belief networks, also called Bayesian 
networks,^ provide the framework and mechanisms for 
performing probabilistic reasoning about the relation- 
ship between the observations of a vehicle’s actions 
and the vehicle’s reasons (plans, goals, etc.) for per- 
forming that action. While manual construction of 
belief networks to represent the UM-PRS plans can 
be done, it is extremely time consuming and subject 
to variability. When considering the large number 
of possible KAs for complex tasks, and hence the 
large number of possible plans that might be cre- 
ated and executed, manual conversion of the plans 
becomes impractical. By providing an automated sys- 
tem, this process can be performed quickly, efficiently, 
and without variation. 

Conclusions 

The representation and control scheme that we 
have implemented has proven to be sufficiently pow- 
erful for planning and execution in procedurally rich 
domains, specifically in a robotic reconnaissance task. 
Our experiments have shown the initial implemen- 
tation to reactively switch between goals, while re- 
maining committed to the current method of achiev- 
ing each goal. We are actively working on many ex- 
tensions to the initial implementation (such as met- 
alevel control, and coordination mechanisms, as out- 
lined above) that will make it even more powerful and 
flexible. 
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Appendix: Example system 

The example system included here as a demo for 
UM-PRS is an early version of a KA library and prim- 
itive functions developed to demonstrate the applica- 
bility of UM-PRS to mobile robot tasks. 

The idea of the demo is to show that a planner can 
be used as a triggering device to enable and change 
vehicle behaviors. 

• The vehicle starts out by issuing a command to 
start the YARF road following behavior. This 
moves the vehicle forward until it reaches a cone 
or goes past a maximum allowed distance. 

• UM-PRS will wait until the vehicle is stopped (0 
= not stopped, 1 = stopped). UM-PRS then will 
query to see if the vehicle has seen the cone or 
passed the max distance. 

• If it passes the max distance, then the vehicle stops 
and the demo is done. 

• If the vehicle detects the cone, then the vehicle 
stops and waits for the next behavior. 

• If the vehicle sees the cone, then UM-PRS will issue 
an Approach Cone behavior. UM-PRS will then 
wait until the vehicle has stopped and check if it 
has reached the cone, 

• If the vehicle has reached the cone, then UM-PRS 
will issue an Off Road behavior. UM-PRS will wait 
until the vehicle has stopped and reached the end 
point. 

• When the vehicle has reached the end point, then 
the demo is done. 

KA code 

GOALS: 

(ACHIEVE cone_demo) 

FACTS : 

(vehicle_status “True") 

(demo^done "False") 

(cone_found "False") 

(cone_reached "False") 
(vehicle_reached "False") 
(vehicle_maxdist "False") 

(vehicle_s topped "True") 


// 

// KA 1 

// 

KA { 

NAME: 

"complete cone demo" 

DOCUMENTATION : 

"This is the main KA 
that will start the cone demo" 

PURPOSE : 

(ACHIEVE cone^demo) 

CONTEXT : 

(FACT vehicle_status "True") 


( FACT demo_done "False") 

BODY: 

(1 (ACHIEVE vehicle_initialized) 2) 

(2 (ACHIEVE road_scouted) 3) 

} 

// 

// KA 2 

// 

KA { 

NAME: 

"initialized vehicle" 

DOCUMENTATION : 

"initialized all the vehicle controls" 
PURPOSE : 

(ACHIEVE vehicle_initialized) 

CONTEXT : 

(FACT vehicle_status "True") 

( FACT demo_done "False") 


BODY: 

(99 

(1 

(EXECUTE ini t_dat abase 
(EXECUTE home_robot $x 

2) 1) 

$y $to) 2) 

(2 

(ASSERT 

vehicle_ini tali zed) 

3) 

(3 

(ASSERT 

YARF 

2) 

4) 

(4 

(ASSERT 

APPROACHCONE 

4) 

5) 

(5 

(ASSERT 

OFFROAD 

8) 

6) 

(6 

(ASSERT 

CHECKVEHICLE 

16) 

7) 

(7 

(ASSERT 

STOPPED 

2) 

8) 

(8 

(ASSERT 

CONEFOUND 

4) 

9) 

(9 

(ASSERT 

MAXDIST 

8) 

10) 

(10 

(ASSERT 

READCHEDCONE 

16) 

11) 

(11 

(ASSERT 

READCHEDVEHICLE 32) 

12) 

(12 

(ASSERT 

VEHICLESTATUS 

64) 

13) 


H 

// KA 3 

// 

KA ( 

NAME: 

"road scouted" 

DOCUMENTATION: 

"This KA will scout out a road, 
as per the scenario plan" 

PURPOSE : 

(ACHIEVE road_scouted) 

CONTEXT : 

(FACT vehicle_status "True") 

( FACT demo_done "False") 

BODY: 

(1 (ACHIEVE road_f ollowed_until_cone) 2) 
(OR 

((2 (FACT cone_found $value) 3) 

(3 (TEST (== $value "True")) 5) 

(5 (ACHIEVE cone_approached) 6) 

(6 (FACT cone__reached $value) 7) 

(7 (TEST (== $value "True")) 8) 

(8 (ACHIEVE traveled_off_road) 9) 
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(9 (FACT vehicle_reached $value) 10) 
(10 (TEST (== $value "True")) 11) 

(11 (ASSERT demo_done "True-^) 12)) 

( (2 (FACT vehicle_maxdist $ value) 20) 
(20 (TEST (== $value "True”)) 21) 

(21 (ASSERT demo^done "True") 12))) 

} 

// 

// KA 4 

// 

KA { 

NAME: 

" road_f ollowed_until_cone” 

DOCUMENTATION: 

"This KA will follow the road 
until the vehicle sees a cone 
or passes the turn off point" 

PURPOSE : 

( ACHIEVE r oad_f o 1 lowed_unt i l_cone ) 
CONTEXT : 

(FACT vehicle_status "True") 

(FACT cone_found "False") 

(FACT vehicle_maxdist "False") 

(FACT YARF $YARF) 

(FACT STOPPED $ STOPPED) 

(FACT CONEFOUND $CONEFOUND) 

(FACT MAXDIST $MAXDIST) 

BODY: 

(1 (EXECUTE start_behavior $YARF) 2) 

(2 (EXECUTE check_behavior 
$ STOPPED 

$vehicle_stopped) 3) 

(3 (FACT vehicle_s topped $value) 4) 

(OR 

((4 (TEST (== $value "False")) LOOP 2)) 
((4 (TEST (== $ value "True")) 5) 

(5 (EXECUTE check_behavior 
$CONEFOUND 
$cone_found) 6) 

(6 (ASSERT cone_found 

$cone_found) 7) 

(7 (EXECUTE check_behavior 
$MAXDIST 

$vehicle_maxdist) 8) 

(8 (ASSERT vehicle_maxdist 

$vehicle_maxdist) 9))) 

/* 

start YARF behavior 
while (not done) 
if (vehicle stopped) 
done = true 

if (vehicle max distance) 
stop all, 

we are done with demo, 
mission was not accomplished 
else if (cone found) 
assert found cone 

*/ 

} 

// 

// KA 5 


// 

KA { 

NAME: 

" cone_approached " 

DOCUMENTATION : 

"When the vehicle sees the cone, 
it approaches it" 

PURPOSE : 

(ACHIEVE cone_appr cached) 

CONTEXT : 

(FACT vehicle_status "True") 

(FACT cone_reached "False") 

(FACT APPROACHCONE $ APPROACHCONE ) 

(FACT STOPPED $ STOPPED) 

(FACT REACHEDCONE $REACHEDCONE) 

BODY: 

(1 (EXECUTE start_behavior 

$ APPROACHCONE) 2) 

(2 (EXECUTE check_behavior 
$ STOPPED 

$vehicle_s topped) 3) 

(3 (FACT vehicle_s topped $value) 4) 

(OR 

((4 (TEST (== $value "False")) LOOP 2)) 
((4 (TEST (== $value "True")) 5) 

(5 (EXECUTE check_behavior 
$REACHEDCONE 
$cone_reached) 6) 

(6 (ASSERT cone_reached 

$cone_reached) 7) ) ) 

start approach cone behavior 
while (not done) 

if (vehicle stopped) 
done = true 
if (reached cone) 
assert at cone 

*/ 

} 

// 

// KA 6 

// 

KA { 

NAME: 

" traveled_of f_road" 

DOCUMENTATION : 

"When the vehicle is at the cone, 
it does some off reading" 

PURPOSE : 

(ACHIEVE traveled_of f_road) 

CONTEXT : 

(FACT vehicle_status "True") 

(FACT vehicle_reached "False") 

(FACT STOPPED $ STOPPED) 

(FACT REACHEDVEHICLE $REACHEDVEHICLE) 
(FACT OFFROAD $ OFFROAD) 

BODY: 

(1 (EXECUTE start_behavior $OFFROAD) 2) 
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(2 (EXECUTE check_behavior 

$STOPPED $vehicle_stopped) 3) 
(3 (FACT vehicle_s topped $value) 4) 

(OR 

((4 (TEST (== $value "False")) LOOP 2)) 
((4 (TEST (== $value "True")) 5) 

(5 (EXECUTE check_behavior 
$REACHEDVEHICLE 
$vehicle_reached) 6) 

(6 (ASSERT vehicle_reached 

$vehicle_reached) 7 ) ) ) 

/* 

start off road behavior 
while (not done) 

I if (vehicle stopped) 

done = true 
if (reached vehicle) 
assert at vehicle 

*/ 

} 

// 

// KA 7 

// 

KA { 

NAME: 

"vehicle_status_checked" 

I DOCUMENTATION : 

! "Check to make sure the vehicle is ok 

I once in a while" 

I 

PURPOSE : 

(ACHIEVE vehicle_status_checked) 

CONTEXT : 

(FACT CHECKVEHICLE $CHECKVEHICLE) 

(FACT VEHICLESTATUS $VEHICLESTATUS) 

BODY: 

(1 (EXECUTE start_behavior 

$CHECKVEHICLE ) 2 ) 

( 2 ( EXECUTE check_behavior 
$VEHICLESTATUS 
$vehicle_s tatus ) 3 ) 

(3 (ASSERT vehicle_status 

$vehicle_status) 4) 

/* 

if (vehicle status != OK) 
stop vehicle, stop prs 

*/ 

} 
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Absliact 

Parallel robotic mechanisms have good accuracy, high 
stiffness and large payload to weight ratio compared 
to the traditional serial mechanism. This paper 
compares two simple constant gain control schemes 
for a parallel robotic mechanism actuated by hydraulic 
cylinders. One of the control schemes which will be 
refoied to as a "rate based schone" uses the position 
and rate information only for feedback. The other 
control scheme referred to as the "force based scheme" 
feeds back the force information also. It is shown 
that for parallel robots with hydraulic actuators the 
response of the end-effector can be improved by using 
the force information from the actuates without 
adding any extra computational burden. The force 
based control scheme can also be easily modified to 
control the forces on the end-effector. The control 
scheme has been implemented in a computer 
simulation and the results are presented in the paper. 

Introduction 

Recently, there has been a lot of activity in the area 
of parallel mechanisms (Figure 1). Parallel 
mechanisms are able to overcome many of the 
shortcomings of serial mechanisms. They have a 
high stiffness, large payload to weight ratio and good 
accuracy compared to smal mechanisms. The best 
known application of parallel mechanism is the 
Stewart Platfmm which is used in aircraft simulators. 
These mechanisms also have potential for application 
in zero^)artial gravity simulators, assembly tasks and 
precision machining. 

Previous research in parallel mechanisms has been 
focused on die kinematics of the mechanisms. The 
general kinematic considoadons are examined in [3], 
[4], [7], [13] and [14]. The direct position kinematics 
of some special parallel mechanisms are given in [S], 
[9] and [10]. There is little research in the dynamics 
and controls of parallel mechanisms. Some of the 
dynamics issues are examined in [IS]. 

An important part of assembly tasks is the control of 
interaction forces between components. In other 
applications such as partial/zero gravity simulators, it 
will be required that the actuators apply constant 
forces through the cento* of mass of the end effector. 
In this application, again the forces on the end 
effector will have to be controlled. The rate based 


scheme does not offer any capability fo controlling 
forces. The force based control scheme developed in 
this paper can easily be modiEed to ctmtrol these 
forces. This ability is crucial to the successful 
application of parallel mechanisms to assembly tasks. 

The parallel mechanism analyzed in this paper is 
shown in Figure 1. It is a six degree of freedom 
mechanism. The top member and the base member 
are connected by six limbs. Each limb is a hydraulic 
cylinder with universal joints at each end. The piston 
and the cylinder are allowed to rotate with respect to 
eachotha*. Thus each Umb is a six degree of fii^om 
serial chain with one actuated prismatic joint. The 
universal joints in the top member are located in a 
plane and the universal joints in the base member are 
also located in a single plane. 

The first section of the p^r examines the linearized 
dynamic model of the fully parallel mechanism. 
Using the linearized model, Ae control schemes are 
studied. The following section describes the conuol 
scheme in detail. The next section deals with the 



Figure 1. Special Parallel Mechanism (Each limb 
consists of universal joints at each end and a 
cylinderical joint between them; the prismatic joint is 
actuated). 


Copyright © American Institute of Aeronautics and 
Astronautics, Inc., 1976. All rights reserved. 
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complete dynamic simulation of the parallel 
mechanism. To simulate the dynamics of the 
mechanism, it is essential to have an efficient scheme 
for the direct dynamics problem. This will help in 
reducing the computational time required for 
simulation. The models for the hydraulic cylinders 
and the servolvalves are given in Appendix B. The 
results are discussed in the last section. 


Linearized Rigid Body Nfodel 

In fully parallel mechanisms ([14]), the joint rates and 
the end effector motion are relat^ by the following 
equation: 


L = H 


T 



( 1 ) 


where 

L= Vector ctxisisdng of the joint rates. 

H = 6x6 matrix. 

(0 = angular velocity of the end effector. 

p. s velocity of the point on the end effector 

coincident with the origin. 

The matrix H is a purely geometric quantity. The 
columns of the matrix H are the wrench axis of the 
joints. 

For parallel mechanisms, static force decomposition 
equation is: 


R 

r 


= HF 


( 2 ) 


where 

R = Forces acting on the end effector. 

r = Moments acting on the end effector about the 

origin. 

H = 6x6 matrix 

F = Forces/Twque’s at the joints. 


It will be assumed without any loss of generality that 
the origin of the fixed coordinate system is located at 
the nominal position of the cento' of mass of the end- 
effector. 


The current approach, based on the rate control 
scheme, is to control the joint lengths in the parallel 
mechanism. Using this scheme, current position of 
the end effector is compared to the desired position. 
The error in the position is convoted to an error in 
the length of the limbs. The limbs are then 
individually controlled to eliminate this oror. This is 
achieved by applying forces in the limbs that are 


propmtional to the error in the limb lengths. This is 
given by the equation: 

F = -KpL-KyL (3) 


where 

L = etnx' in the limb lengths. 

The equation of motion of the end effecUM’ is given by 
(ignoring the non-linear toms): 



where 

M s Diagonal matrix with the mass of the end 
effector as the diagonal term. 

J s Moment of inertia matrix at the current position, 
p, 0 = small errors in position and orientation of ^d 
effector. 


Using equations (2), (3) and (4), the response equation 
is (ignoring the actuator model): 



-i-HKvH'*' 


P 

0 


+ HKpH^ 


P 

e 


= 0 


(5) 


The gain matrices in the above equation, Kp and Ky 
are diagonal matrices with constant terms (the 
cylind^ are controlled independently with constant 
gains). Clearly, the response depends on the matrix 
H, wMch depends on the position of the end effector. 
This is undesirable. It is also not obvious from the 
above equation, whether an increase in the gains 
would lead to an improvement in the response of the 
mechanism. Increase in the gains might lead to 
deterioration in the response due to the structure of 
the H matrix. Further, increasing the gains, leads to 
a shift in all the closed loop poles of the above 
system. The fast poles as well as the slow poles are 
moved simultaneously. This leads to a marginal 
improvement in the response of the system with large 
changes in the gains. There is also the possibility of 
exciting the higher order dynamics due to the presence 
of fast poles (as a result of increase in gains). 

As opposed to the above rate based sch^e, in the 
force based control scheme, the force is controlled in 
each limb. The error in the current position of the 
end effectCH’ is fed back as error in the forces exerted 
by the limbs. This is given by the equation: 


■R' 

= -K„ 

P 

-K„ 

P' 

r 

p 

_e_ 

V 

0 
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The force to be produced by the actuators at the limbs 
is given by (equation (2) and (6)): 




+ K, 


( 7 ) 


The response equation is now given by (combining 
equations (4) and ( 6 )): 


The above scheme can also be easily adapted to 
cmitrol the forces on the end effector. The direction 
in which fences are to be controlled can easily rq>lace 
the position errOTS in equation (6). This would 
effectively control the forces in those directions. 

Dynamic Simulation and Control Scheme of the 
Parallel Mechanism 



In a simple implementation of the above scheme, Kp 
and Kv can be chosen as diagonal matrices with fixed 
diagonal terms. The response equations are now to a 
large extent position independent and decoupled. The 
response does not depend on the H matrix. The 
coupling and the position dependency in the above 
equation will come from the J matrix. This response 
equation is an improvement over the previous 
Kheme. The closed loop poles can be assigned 
independently in the above scheme and the response 
improved without a corresponding large increase in 
the gains. 


A rigid body dynamic model was developed for the 
parallel mechanism for simulation in MATRIXX. 
Some of the assumptions made for the dynamic 
model are as follows. The limbs were assumed to be 
axisymmetric. The friction losses in the spherical 
and revolute joints were assumed to be negligible. 
The prismatic joint in the cylinder was assumed to 
have viscous losses and the damping coefflcirat was 
adjusted to obtain a response that closely 
approximated the response from the actual 
mechanism. The model for the hydraulic cylinders 
consists of a leakage term which is assum^ to be 
propcHtional to the pressure difference in the cylind^ 
(assuming that the leakage flow is laminar). The 
model for the servovalves was taken from [8]. 



Figure 2. Block Diagram for Rate Based Control Scheme. 
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Figure 3. Block Diagram for Force Based Cbntrol Scheme. 
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The block diagram fw the rate based control scheme 
is shown in Figure 2. In this control scheme, the 
hydraulic cylinders were controlled by dual stage 
servovalves. Within the servovalve, the spool 
position is fed back to the electromagnet with a 
spring. The block diagram for the servolvalve is 
given in Appendix B. The oror in the position and 
the velocity are fed back to the cunent in the 
servovalve. The current in the servovalve is given 
by: 

i = -kpleL-kvleL (9) 

where 

i = 6x1 vector, 
kp, kv - constants. 

I = 6x6 identity matrix. 

CL = error in limb lengths. 

6]^ = error in limb length rates. 

A decoupled controller with constant gains is chosen 
for the controller because the gains are associated with 
the hydraulic cylinders. Since all the hydraulic 
cylinders are identical, the gains are kept the same for 
all the cylinders. 

The block diagram for the force based control scheme 
is shown in Fig. 3. In this control scheme, the 
cylinder woe controlled by a proporticxial servovalve. 
It was found that the spool position feedback in the 
dual stage servovalve inhibited the response of the 
mechanism. The maximum spool opening from the 
servovalve was the same in both the valves to ensure 
that a fair comparison could be made in both the 
schemes. 


The desired force in each of the limbs is given by: 



whoe 

Kp, Kv = Diagonal constant matrices. 

Cp = error in position. 

Cp * error in velocity 

ee = error in angular position. 

= error in angular velocity. 

The gain matrices Kp and Kv are chosen to be 
diagonal matrices. At the current location of the end- 
effector, the off diagonal terms could have been 
chosen to decouple the system. This decoupling 
however, would not be possible for the complete 
woricspace due to the change in the inertia matrix. 


The error in the desired ftxce in the limbs given by 
equation (10) and the achial force in the cylindo' is fed 
back to the spool. The position of the spool is then 
given by: 

X,=kfef (11) 

where 

kf= constant 

ef = 6x1 oror vectOT in the ftaces. 

The gain for the force controUa are kept the same for 
all the cylindos, since the cylind^ are idratical. 

Rigid Body D ynamic Model 



Figure 4. Parallel mechanism with a single limb and 
the variables. 

The six degree of freedom parallel mechanism shown 
in Figure 1 consists of 13 rigid body elements. The 
six equations of motion can be obtained for each 
element, resulting in a large matrix (in this case 
78x78 matrix) equation. This equation will be 
computationally expensive to solve. In the following 
section, a compact 6x6 matrix equation for the 
dynamics of the parallel mechanism is obtained. This 
would result in substantial savings in the 
computational time for the simulation. 

The convention followed in the test of the paper is as 
follows. All vectors and matrices are in bold letters. 
The siq>erscript on the top left hand comer of a vector 
denotes the coordinate system. A superscript "t" 
refers to the fixed coordinate system coincident with 
the coordinate system attached to the top memb^ and 
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"li" lefos to the fixed coordinate system coincident to 
the coordinate system attached to the ith limb. No 
supo^pt signifies the fixed worid coordinate system 
attached to the base member. 


The equations of motion of the top member are given 
by: 

M‘a = -X‘Fi3+M‘g 
i 

J*a = -5:(‘qix‘Fi3+‘Ti3}f*0)Xj‘a) (12) 

where 


m 

0 

O' 

0 

m 

0 

.0 

0 

m 

h 

- 

-J*y 

-J*y 



-J*z 


-Jyz 


V*b[o 0 -gf 

g == accel^ation due to gravity. 

^Fi 3 = force applied by the top member to the ith 
limb. 

^Ti 3 = Torque £q)plied by top member to the ith limb, 
^i = position of the ith force. 

ko = angular velocity of the top member in the top 
coordinate firame. 


In the above equations, the forces are unknown. 
These forces are applied to the top member by the 
limbs. The additional equations will come from the 
dynamic analysis of the limbs. These dynamic 
equations will be in the form: 


*‘ai3='*R,(*a-qX‘a+t) 


(14) 


where 


0 


qX = 


*Qiz 

-‘qix 


qiz qiy 

0 -‘q« 

‘q« 0 


t = 


-qix(‘«>5+‘«>zK®*(*“z*qz+‘®y*qy) 
-*qiy(*Wz+‘o)*)<-‘o>y(‘o>x‘q*+*Wz‘qz) 
-qiz( *0)*+‘ w? y<Oz( ‘Wy ‘qy+‘®z ‘q* ) 


Combining (13) and (14) and rearranging t^rns: 

[I3x3 -qxj‘*j=‘RiD‘<Rt‘Fi3+‘Ri^di-t (15) 


The above constraint equation can be obtained for 
each limb. This will give 6 such equatiems. 

Another set of constraint equations can be obtained 
for Ti 3 . The angular accelerations of the upp^ part 
of the limb and the top member are related by: 


r - 1 


1 

1 

1 

=‘‘R, 

'«y 


(16) 


The top two members of the above vector equation 
are not important and thus are left blank. This 
equation can be combined with the last equation in 
equation set (16) to get 


>‘ai3 = Di*‘Fi3+di (13) 

where 

8 i 3 = acceleration of the ith univ^^ joint. 

Fj 3 = force transmitted across the ith universal joint. 

The development of the above equations for the 
parallel mechanism under ccxisideratitMi here are given 
in Appendix A. 

The acceleration of the joint and the center of mass of 
the top member are relmed by: 


*Ti3-*Rl, 


0 0 0 
0 0 0 
0 0 Jj2z. 


f*R,‘a 


(17) 


The equation of motimi of the top memb^ und» the 
influence of forces Fy are given by (12). Equation 
(IS) can be used to eliminate forces Fi3 from 
equation (12) and equation (17) can be used to 
eliminate To from equation (2). This leads to: 
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(18) 


I qX‘Ri Dr"‘R, ■ 

i»l,6 I a 

1+ _Sj‘Rl.Ji2z*'Rt -qX*Ri Dr“‘RtqX) ra 


M+ X *RiDr‘*‘R, 

i=l,6 ‘ 

X qX*R, Dr"‘Rt 

. i=l,6 

X (‘RiDr‘di-‘RiDr“‘R,t)+M‘g 

X (qX*RiD7*d-qX‘R,Dr"‘RA*(0Xj‘(0 
> 1 . 6 ' ' ‘ . 

These equations can be used to solve for the 
accelerations of the top member. These are a very 
efficient form of the dynamic equations. 

The above accelerations are in the coordinate system 
fixed to the top member. They can be nansformed to 
the acceltf ations in the fixed coordinate frame by a 
simple coordinate transformation. Integrating the 
accelerations will give the velocity of the top 
member. The angular velocity of the top member can 
be used to calculate the euler angle rates using the 
following equation: 


d' 


Cosy 

Siny 

0 

P 

= SecP 

-Siny CosP 

Cosy CosP 

0 

.7. 


Cosy SinP 

Siny SinP 

CosP 


The above rigid body model was combined with the 
dynamic models for the hydraulic cylinder and the 
servolvalves. The details of these models are given in 
Appendix B. 

Discussion of Results 

The results from the simulation are shown in Figures 
(S)-(IO). The top member was moved along the z- 
axis from a initial posititm of (0,0,140.0) to a final 
position of (0,0,140.2). while keqting the mientation 
fixed. The move was commanded at 0.1s from the 
start of the simulation to allow for the system to 
attain equilibrium. The gains for the controller were 
obtained by modifying the gains obtained from a 
linearized analysis. 

The results from the rate based scheme are shown in 
Figures (S)-(7). The final position is obtained in 
0.122s. The move in the z-direction induces 
oscillation in the x and y directions. If the gains are 
increased any further, the system will become 
unstable. 

The results from the force based scheme are shown in 
Figures (8)-(10). The final z-position is reached in 
0.058s. This is a 53% improvement in the step 
response of the mechanism in the z-direction. The 
gains in this system can be individually adjusted for 


each direction and thus the response in the x, y and z 
direction can be individually tailored. There is also a 
significant steady state error in the response. This is 
an expected error due to the weight of the top member 
and the upper part of the limbs. This error can be 
removed by adding a feedforward term to the 
controller. The mechanism in the beginning of the 
move can estimate the weight of the top member and 
then use this information in the feedforward term. 

Conclusions 

A simple constant gain f<m:e control scheme has been 
devised fw the parallel mechanisms. This scheme is 
easy to implement and is better suited to parallel 
mechanisms than rate control schemes. The force 
ctxitrol scheme requires the computation dT H'^ while 
the rate control scheme uses the inverse position 
kinematics. The H matrix can be inverted 
symbolically to save on computational expose. 

The preliminary results indicate that the force based 
control scheme will improve the response over the 
rate based control scheme. Further worir, howev^ is 
required to reach any comprehensive conclusions. 
There are important unanswered questions about the 
effect of the position of the end-effector and the 
payload mass on the response of the mechanism. 

This control scheme can be easily adapted to control 
forces on the end effector. This capability will be 
crucial in applications of parallel mechanisms such as 
assembly tasks or partial/zero gravity simulators. 
The force control capabilities of the above control 
scheme also needs further investigation. 
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Appendix A 


r s position of the origin of the top coordinate 
system in the fixed coordinate system. 



Figure al. Free body diagram of the lower part of 
limb. 


Fi3z 



Figure a2. Free body diagram of the upp^ part of the 
limb. 

Given the position of the coordinate system attached 
to the top member and its orientation, the position 
vector of the ith joint on the top member, in the fixed 
coordinate system can be computed by 


where 



Cosy CosP 
Stay CosP 
-SinP 


-Siny Cosa + Cosy Sinp Sina 
Cosy Cosa + Siny Sinp Sina 
CosP Sina 


Siny Sina + Cosy Sinp Cosa ' 

-Cosy Sina + Siny Sinp Cosa 
CosP Cosa 

*qi = position of ith joint in the top coordinate 
system. 


The vectors along the limbs in the fixed coordinate 
system are given by: 

li=Si-pi 

The limb lengths are the magnitude of this vector. 



Fmr each limb, a coordinate system attached to the 
limb is setup. The limbs will be assumed to be 
axisymmetric. The z-axis for the new limb 
coordinate system is along the limb. The x-axis is 
perpendicular to the z-axis of the limb coordinate 
system and the x-axis of the fixed coordinate system. 
Thus the rotation matrix is given by: 



0 


liz 

1 

Vl-lbr 

lixliz 

liy 

1 

1 


where 

iix> iiy. liz = components of the vector li. 

The velocity of the ith joint is given by: 

*Sj=*v+‘a)x‘qi 

where 

^v = velocity of the center of mass (coincident with 
the origin) of the top member. 

‘to = angular velocity of the cent»’ of mass of the top 
member. 

The velocity of the ith joint in the limb coordinate 
system is: 


where 

In the following section, all the variables are in the 
limb coordinate system. The superscript is avoided 
for clarity. 
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The angular velocity of the ith limb in the limb 
coordinate system is given by: 


Six 

fi-J. = 1&. 

^ly - j 

The velocity of the prismatic joint is given by: 
i = Siz 

The required equations of motion of the bottom link 
are: 


”^i2x ~ (^ilx ***il^il 1 )®ix "*■ i2y ■*" l®il8iy 
Ti2y = (^ily ®*il^ill)®iy ■*‘^il^i2x 


where 


Jilx = mass moment of inertia of the lower limb 
about the x-axis. 

Jily = mass moment of inertia of the lower limb 
about the y-axis. 


Six 

8iy 

=‘‘Rb 

'O' 

0 

_8iz. 




cocvdinate system. 


gravity vector in the limb 


g = acceloation due to gravity (9.81 m/s^). 


«iy = CillFiSx + Cil2 
«ix = Ci2lPi3y + ^i2i 
ij =Cj3iFi3x +Ci32 

where 
Cm = 


(a3) 


li 


Ci21 = “ 


Jily + Ji2y + + mi2(li “ *>i22) 

li 


Ci22=- 


Jilx + Ji2x + ™ill>m + ™i2(li “ ^i2l) 

^ [mjibiii + mi2(lj - bj22)]gix 

Jily + Ji2y + n*ill>?ll + ™i2(li “ *>122)^ 

2mi2iiC0iy(lj - bj22) 

Jily + Ji2y + ™ilb?ll + ™i2(li “ l>i22)^ 

[miibjn+mi2(li-bi22)]giy 

Jilx + Ji2x + ™ilbni + mi 2 (li - bj22) 

2mi2ii(0ix(li - bj22) 

Jilx + Ji2x + ™ilbm + nii 2 (li - bi22)^ 

1 

Ci3i=-— 

™i2 

Fj, + mi 2 gix + nii 2 {li - bi 22 )(to^ + w?y) 

Ci32 = 

mi2 

Fia = force fnxn the actuator 


The equation of motion of the top limb are given by: 

™i2[(li “ bi22)ciiy + 2ii(0iy ] = Fi3x - Fj2x + n»i2gix 
***i2[~(li “ bj22)ctix “ 2ijC0jjj j = Fj3y — Fj2y 
+n>i2giy 

*^i 2 ['li “ (li ~ bj 22 )(t 0 ix + CJjy jj = Fj 32 — Fj 2 x 

"*’***i2giz 

Ji2x«ix =-bi22Fi3y -bi2iFi2y -Ti2x 

Ji2yOiy = bi22Fj3x + bj2lFi2x “ Ti2y 

Ji2z®iz ~ 'Fj3]j (a2) 


where 

C(ix> cciy> otiz = angular acceleration of the limb in 

the limb coordinate system. 

mi2 = mass of the upp^ part of ith limb. 

Ji2x. Ji2y. Ji2z = ”>ass moment of inertia of the 
upper part of ith limb about its center of mass. 


The angular accelo:ation in equation (a3) are related to 
the acceleration of the top joint by the equation: 


-ai3y - 2iit0ij 
^iSX “ 2iit0iy 

“iy= i 

*i 

Combining equations (a3) and (a4): 

ai3 = DiFi3 + di 


where 


*i3 ~ [®i3x ®i3y ®i3z] 

Fi3 = [Pi3x Fi3y Fi3j] 



0 0 
-Ci2ili 0 

0 Cj3i 


(a4) 

(a5) 


CcHnbining equations (al) and (a2): 


(b5) 


di = 


Cil2li+2iWiy 
-Ci22li - 2i(0ix 
Ci32-l(“u+“?y) 


This matrices Di and di in this equation can be 
computed fcM- each limb. 

Appendix B 

The model for the two stage servovalves for small 
displacements is given by the block diagram in Fig. 
1 . 



Figure bl. Block Diagram of Flow Control hydraulic 
servovalve ([8]). 

The flow through an orifice is given by the equation: 
Q = *^VPi “P2 (bl) 


where 

Q = volume flow rate. 

A = area of the oriflce. 

PI = pressure on one side of the orifice. 

P2 = pressure on the other side of the orifice. 

K = constant. 

For a positive displacement of the servovalve xj, the 
supply pressure is connected to chamber 2 of the 
cylinder and the return side is connected to the 
chamber 1. The equations are given by: 

Qin = KsXf VP|-P2 + Ki (pi - P 2 ) (b2) 

Qout ~ ^»*»VP1 ~~ Pr ~~ Kl(Pl “ P 2 ) (1*3) 

These flow rates are equal to the corresponding 
volume changes in the chambers. These change are 
given by: 


Qout 


. Ai(bi + b2 -1) dp\ 

‘ P dt 


Using the above set of equations: 

A2(l-b2) dp; ^ k,|x,|Vp, -P2 -IA 2 
+Ki(pi-P2) 


Ai(bi+b2~l) dpi 
p dt 


-K,|x,|Vpi-p^+iAi 


-Ki(pi-P2) 

If xs is negative, then these equations are: 


(b6) 

(b7) 


A2(1~1>2) dp2 
p dt 


= -K,|x,|Vp2-pr -1A2 


+l^l(Pl“P2) 
Ai(bi+b2~l) dpi 
p dt 


= K,|x,|^p,-pi +iAi 


~Ki(P1“P2) 


(b8) 

(b9) 


The above set of diffo-ential equations can be solved 
to obtain the pressure in the cylinder chamb^. The 
force on the limb is given by: 

Fi« = P2A2-PiAi-ci (blO) 

where 

c = damping coefficient. 

This completes the development of the equations for 
the hydraulic cylinder. 


Qin 


= iA2 + 


A2(l-b2) dp2 
p dt 


(b4) 
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Abstract 

A method of dealing with singularities and joint 
limits in the inverse kinematics for both redundant and 
non-redundant serial-link manipulators is presented. 
The method uses damped least squares with dynamic 
weighting for the approximate solution of the inverse 
Jacobian problem. Damped least squares has become a 
popular approach for dealing with singularities. The 
method presented extends the utility of damped least 
squares by incorporating dynamic weighting matrices 
within its formulation. This allows specific joints to be 
targeted in the minimization of the joint differential 
vector. An efficient algorithm is given for the solution 
of the weighted damped least squares problem. This 
algorithm is implemented, along with an algorithm to set 
the weights, for a six d.o.f. telemanipulator slave. A 
solution that is approximate in the task space and that is 
physically realizable in the joint space is obtained at or 
near singularities and/or joint limits. Away from 
singularities and joint limits an exact solution is 
obtained. The results are a well behaved slave 
manipulator under teleoperational control even when the 
slave is at or near singularities and/or when unreachable 
configurations are commanded. 

1 Introduction 

The inverse kinematics problem can be stated as 
follows: given a desired position and orientation of the 
end effector of a manipulator find a joint configuration 
that satisfies it. This problem is central in the control of 
robot manipulators. Any time the motions of a 
manipulator are described in a general space such as a 
Cartesian space, the inverse kinematics must be solved. 
In order to avoid this, manipulators are often controlled 
by describing the motions only in the joint space. This is 
done, howevCT, with a great loss of generality. For 
serial-link manipulators the inverse kinematics problem 
is complicated by non-linearities, singularities, 
unreachable configurations, multiple solutions, and even 
infinite solutions in the case of a redundant manipulator. 
The nonlinearities can be avoided by calculating the 
inverse kinematics iteratively using the Jacobian of the 


manipulator. Redundancies, while complicating the 
solutions, are actually utilized to satisfy some criteria 
that is secondary to the motion of the end effector. This 
is a large body of research. The focus of this paper is in 
dealing with singularities and unreachable 
configurations. 

The methods discussed here utilize the damped 
least squares inverse of the Jacobian, which has been 
proposed by many researchers for the inverse kinematics 
problem.*-^’^-® This inverse has the benefit of being 
compatible with solutions based on the pseudoinverse 
which has become a very popular method of calculating 
the invCTse Jacobian. The pseudoinverse has become 
popular for many reasons including the utilization of 
redundancy.*-’-*’® The use of damped least squares 
results in an approximate solution with a decrease in the 
size of the solution vector. This is beneficial in 
controlling the large joint rates resulting from exact 
solutions near singularities. The addition of dynamic 
weighting matrices in the damped least squares solution 
is proposed in this paper, to increase its utility. Using 
weighting matrices the damped least squares solution is 
extended to methods of dealing with unreachable 
configurations caused by joint limits. The previous 
research with damped least squares has dealt mainly 
with singularities. The dynamic weights can also be 
used to target the problem joints, of a particular 
singularity in reducing the size of the joint space solution 
vector. 

For any serial-link manipulator a particular 
configuration of the joints corresponds to a unique 
position and orientation of the end effector in Cartesian 
space. This relationship is described by the forward 
kinematics function of the manipulator. The methods 
used to develop this function are well established. The 
position and orientation, or the task space variable, 
X e R® (generally m=6), of the end effector is described 
as a function of the joint space variable, q e R”, by the 
nonlinear forward kinematics equation, 

x = A(q) (1) 


Copyright <B 1994 by the Azneriom faistitute of Aeranmutia 
and Aatronautica, Inc, All righa zeterved 
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The differential relationship of x and q is described 
using the following linear equation: 

5x = J(q)6q where J(q)g (2) 

dq/dt 

In equation (2) J(q) is the m x n manipultor Jacobian 
matrix. General methods for the development of the 
forward kinematics function and the Jacobian of a 
manipulator can be found in any introductory text on 
robotics such as Craig*®, Paul**, or Koivo*^. (Note: 
Hoeafter the functional dependence of J on q will be 
dropped and assumed to be understood.) 

The inverse kinematics problem almost always 
reduces to that of solving equation (1) for q or equation 
(2) for Sq in an iterative scheme to find q. Analytical 
solutions of equation (1) are known only for a few simple 
non-redundant (m=n) manipulator geometries. The six 
degree of freedom (m=n=6) manipulator geometries for 
which these solutions exist were clarified by Pieper*^. 
Iterative inverse kinematics schemes can be used by 
solving equation (2). This can be done off line or it can 
be done on line within the control system of the 
mmiipulator, using each iteration to calculate the joint 
control law. An example of on-line iterative inverse 
kinematics is resolved rate control.*^ Nearly all of the 
contemporary research dealing with inverse kinematics 
solutions is devoted to finding a solution or an 
approximate solution to equation (2). This is true for 
three reasons: 1) there are many manipulators for which 
an analytical solution to (1) does not exist; 2) the 
nonlinearities of equation (1) impede the development of 
general methods for a numerical solution procedure; 3) 
general methods for dealing with the other complications 
of inverse kinematics can be incorporated in the solution 
of (2). Solutions to equation (2) are commonly found 
by solving a linear system of equations for 5q using some 
well established method, such as Gaussian elimination. 
In the non-redundant, exact case these solutions are 
described using the equation 

Sq = r^5x (3) 

This equation may be generalized to include redundant 
manipulators and/or approximate solutions using the 
following equation: 

6q = J*8x (4) 


In this equation J* is a some type of inverse of the 
Jacobian matrix. If m=n then J* is likely to be 
whereas fix’ redundant manipulators (n>m) J* might be 
the Moore-Penrose pseudoinverse*®, J* = ( JJ^ )“^ . In 

the redundant case with the pseudoinverse, (4) is a 
particular solution, the minimum norm solution, of the 
general solution given by 

8q = J*8x+(I-J*J)v (5) 

Here, v e R" is an arbitrary joint space vector used to 
satisfy some criterion such as obstacle avoidance. The 
null space vector , v, is projected into the null space of 
the Jacobian, taking advantage of the redundancy. 
Therefore, the solution given by (5) still satisfies 
8x = JSq . It might be noted that in the non-redundant 
case where the second term of equation (5) 

vanishes and the null space vector has no effect 

2 Singularities and Workspace Boundaries 

The solutions to the inverse kinematics problem 
given in the previous section work well for control of a 
manipulator when it is not near a singularity or 
workspace boundary. However, near a singularity or 
workspace boundary certain components of commanded 
movements either require large joint rates or are 
physically impossible to satisfy. Therefore a robust 
algorithm for the calculation of the inverse kinematics of 
a manipulator must deal with singularities and 
workspace boundaries. 

Physically, a singularity may be described using end 
effector motions (or forces) in the task space. In a 
singular configuration a manipulator is degenerate, 
causing the end effector to loose degrees of freedom in 
the task space. This means that the robot cannot move 
(control forces) in certain directions or that motion 
(force) in some direction is dependent upon motion 
(force) in others. Near a singularity small motions (large 
forces) in certain directions require large joint rates 
(small joint torques). 

Mathematically a singularity may be described using 
the Jacobian matrix. The Jacobian is rank deficient 
(rank(J)<m) when the manipulator is in a singular 
configuration. In the case of m=n the determinate of the 
Jacobian is zero. Near a singularity the Jacobian 
becomes ill-conditioned and elements of the inverse or 
pseudoinverse are large. If the condition number of the 


862 


Jacobian becomes too large then a general solution 
attempting to solve equation (2) will have numerical 
problems. 

A significant amount of research is devoted to 
developing inverse kinematics with singularity 

robusmess. Methods that utilize redundancy to avoid 
singularities have been proposed by many 

researchers.*^*"^'**’'® HbwevCT none of these methods 
ensure both a nonsingular Jacobian and non-cyclic 
behavior. Also, they need the null space vector which 
might be used for other purposes. Whimey*^ proposed 
removing the under generating block of the Jacobian and 
then using a pseudoinverse to calculate an approximate 
solution near singularities. However this requires a 
different Jacobian for each singularity. Futhermore, 
achieving continuity when switching solutions is difficult 
with this method. The most appealing of the {xoposed 
solutions are those that use damped least squares.*'^'^'^ 

Workspace boundaries are the boundaries between 
that space which is reachable by the manipulator 
(W c R®) and that space which is not (W). These 
boundaries occur in configurations where the 
manipulator is at a singularity(s) or at a joint limit(s). 
Because of joint limits an algorithm that deals with all oi 
the singularities of a manipulator does not necessarily 
deal with all of the workspace boundaries. 

Few solutions to the joint limit problem have been 
given in the literature. Some propose the use of 
redundancy to avoid joint limits.® The methods proposed 
do not ensure their avoidance and are not applicable in 
cases where redundancy is not available. In practice this 
problem has been dealt with at the global level of path 
planning, searching the entire path for unreachable 
configurations. However, this is not always possible 
such as when the manipulator is operated 
teleoperationally with a human giving real time 
commands. 

3 Inverse Kinematics Using Damp«»d T.gast Squares 

The damped least squares solutions to the inverse 
kinematics problem are intended to ensure a well 
conditioned matrix for the solution algorithm while 
limiting the size of the solution vector, 5q. This is done 
by adding a diagonal matrix, ol, to the matrix JJ^. 
Damped least squares may be used when an approximate 
solution to equation (2) is necessary or acceptable. 


K Sqis found using the equation 8q = J*8x, wh«e 
J* is the pseudoinverse. Then the solution,8q, satisfies 

minll6ql|^ (6) 

Sq 

among all 5q satisfying 

min||8x-J6qp (7) 

6q 

where I'll denotes the Euclidean norm. If the Jacobian is 
of full rank then satisfying the constraint (7) is the same 
as satisfying equation (2). 

If the approximate solution of damped least squares, 
8q = J'‘‘8x where 

=J'^(JJ'^+aI^^ a>0, (8) 

is used then the solution satisfies 

min{|8x-J8qf +a^||8qlp} (9) 

6q 

Note that for a=0 the damped least squares solution is 
the pseudoinverse solution. 

In the damped least squares case the size of error in 
the task space is weighed against the size of the resulting 
solution. For a given 8x the size of the solution vector is 
decreased by increasing a. However, this is done at the 
expense of using an approximate solution. As CL 
increases so does the size of the error in the task space. 
A large a has the other benefit of ensuring a well 
conditioned matrix for inversion. It has been shown by 
Mayorga et al.^ that the condition number, K, of the 
matrix P = ( + al) , is 

K = (o?+a)/(a^+a) (10) 

where aj > C 2 ^...S ^ 0 are the singular values of 

the Jacobian matrix. It can be seen in this equation that 
K is made arbitrarily close to 1 by increasing a, thus 
ensuring a well conditioned matrix for inversion even in 
a singular configuration where o,„=0. 

Simply stated, if one is willing to give up the 
exactness of the solution then the size of the joint rates 
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can be reduced and a well conditioned matrix for 
inversion can be ensured by increasing a. It should be 
pointed out that an exact solution may not be physically 
obtainable or even desirable near a singularity. This is 
due to the loss of degrees of freedom in the task space or 
the necessity of large joint rates, which may be 
unachievable or beyond safety limits. 


The weighted damped least squares solution is 
intended to increase the utility of the damped least 
squares solution by providing the ability to significantly 
affect the size of individual components of mimimized 
vectors. This is done by adding weighting matrices to 
the formulation of the damped least squares problem. 


imnjfW, (5x- J8q)f +a^ [w’^Sq 


Taking the diagonal structure of the weighting 
matrices into account, it can be seen in the minimization 
condition (16) that the relative importance of the 
individual components of the task space error vector and 
of the solution vector are controlled by the individual 
elements of and Wq respectively. If a>0 then 
increasing the size of w^. decreases the size of 
(Sx- J5q)i and decreasing the size of Wq decreases the 
size of Sq^. The strict inequalities in (12) may be 
relaxed for a>0 allowing Wq^=0 for some i. This may 
be desired if it is necessary to eliminate the use of some 
Sqii from the solution, such as at a joint limit. 


The importance of individual components in the 
minimization condition of the damped least squares 
solution can be adjusted by scaling the rows and columns 
of the Jacobian before the damped least squares solution 
is calculated. Consider the following reformulation of 
equation (2); 

W,8x = W^JWqW-i8q (11) 

where the weighting matrices are defined by 
Wjsdiag[wj ...w ], Wj >0 

( 12 ) 

Wq 3diag[Wqj ...Wq__ ], Wq, >0 

If the following definitions are made 

6x„^W,8x, 6q,^Wq-^8q, 

Jw-W,JWq, p,=j„jj;+ca 

then (1 1) can be written 


5 x^ — Jw^\ 


In solving the inverse kinematics it is desirable to 
avoid the explicit inversion of a matrix since this is 
computationally expensive. A more efficient algorithm 
will solve a linear system of equations using Guassian 
elimination or some similar method. It is also desirable 
to minimize the number of matrix-matrix multiplies and 
to factor the matrix-matrix and matrix-vector 
multiplication. The following reformulation of the 
problem, will aid in the understanding of the solution 
algorithm presented here. 

8q =Wqj;8x^ 

= WqjT(J^jT+aI)-i 8 x^ 

= W J^P~^8x ^ ^ 

= WqjTy 

where y = Pw^8x^ . 

An efficient solution algorithm fen* inverse 
kinematics problem using weighted damped least 
squares is summarized in the following five steps: 


Solving (14) using damped least squares 
(8qw =Jw(JwJw +“!)”' 8x^ =J^8x^) results in an 
approximate solution to equation (2), given by 

8q = Wq8qw, = WqJ^8xw (15) 

This solution satisfies 


1. calculate or set J,8x,Wx, Wq, and a 

2. form = Wj8x and = W^JW^ 

3. form P(y = JwJw + “I 

4. solve 8x^ = P^y for y using Cholesky 
decompostion 

5. form 8q = Wq J^y 
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It is not claimed that this is the computationally optinud 
algorithm for solving this problem. An optimal 
algorithm is both application and hardware dependent. 
However, this algorithm is fairly efficient if the 
implementation takes advantage of the structures of the 
matrices and vectors involved. Most important are the 
diagonal structure of and Wq , and the symmetry of 
Additionally, is positive definite if: 1) 

rank(J^) = m and a^O, or 2) a>0. can (Mily 
be rank deficient if J is rank deficient or some weight is 
set to zero. In either of these cases, a will be 
significantly larger than zero if it is set correctly. 
Therefore, Cholesky decomposition can be used for the 
solution of the linear system in step 4. One might note 
that the two conditions given above, are actually the 
conditions that ensure is nonsingular, which is 
important in any solution scheme. 

The algorithm above does not include the utilization 
of a null space vector for self motion of redundant 
manipulato rs. A algorithm similar to the one above 
which utilizes the a null space vector requires a 
reformulation of the general solution to equation (2). 
Such a reformulation follows. 

6q = Wq{j;5x^+(I-jX)Vwl 

= Wq + (I - JWq ) Wq“V) 

= Wq (5X^ - JV) + Wq"‘ V) (18) 

= WqJ^P"V,(8x-Jv) + V 

= WqJ^y + v 

where y = P^^W^ (6x- Jv) . 

An algorithm similar to the one above which 
implements the general solution is summarized by the 
following five stq)s. 

1. calculate or set J,8x,v,Wx, Wq, and a 

2. form z = Wjj (Sx- Jv) and = WxJA^ 

3. form P^ = + oI 

4. solve z = P^y for y using Cholesky 
decomposition 

5. form 5q = Wq J^y + v 


6 Setting the Weighting Matrices 
and the Damping Factor 

6.1 General DKCu.s.sion of the Weighting Matrices 
and the Damping Factor 

Developing an algorithm to set the weights and 
damping factor is not a trivial task. There are several 
requirements of this algorithm. The basic idea is to 
dynamically adjust the weights and damping factor so 
that: 1) an exact solution is obtained away fix>m 

singularities and joint limits; 2) an approximate 
solution, that is near the desired solution in the task 
space and is physically realizable in the joint space, is 
obtained near singularities and joint limits; 3) the 
transitions between exact and approximate solutions and 
between different approximate solutions are smooth. 

A non-zero damping factor, a, is used to ensure a 
well conditioned matrix and to include the solution 
vector in the minimized quantity. However, fix a>0 the 
solution is an approximate one. Therefore, away from 
singularities and joint limits, a should be zero. As the 
manipulator ^proaches a singularity or joint limit a 
should be increased in a manner that: 1) ensures a stable 
numerical solution; 2) keeps the joint differentials 
within safe limits. A value of a that satisfies the second 
condition will most probably satisfy the first. It should 
be noted that the first condition is only a concern near 
singularities or when there is a very large relative 
diffo^ence in the size of the individual weights that are 
used. It should also be noted that satisfying the second 
condition is dependent upon the method used to set the 
joint weights. 

The relative sizes of the joint weights are used to 
control the importance of each of the joints in the 
minimization. If a joint has a small weight and a is 
non-zero, then the approximate solution tends not to use 
that joint. For example, in a region near a singularity 
where the rate for joint i tends to be large, Wq. should be 
small. The weight might also be small near a limit for 
joint i. However, setting a joint weight based solely on 
the distance irom the joint limit poses a problem in itself. 
If this is done, the joint will tend to stay at the limit even 
if the exact solution gives a 5qj which is away from the 
limit. Therefore it is necessary to include another 
criterion such as the direction of 5qj in the previous 
iteration of the inverse kinematics. 
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The relative sizes of the error vector weights can be 
used to control the importance of individual components 
of the task space error vector. Increasing w^. decreases 
the size of (5x-J5q)i. While there are certainly 
situations in which it would be useful to dynamically set 
W,, what is suggested here is use of a constant to 
normalize the Jacobian matrix. For m=6 the Jacobian 
contains three rows related to position and three rows 
related to orientation. The units in which position and 
orientation are measured create large differences in the 
relative magnitude of the elements of 5x. A constant 
Wx can be used to balance the importance of position 
and orientation. 

6.2 Previous Work Dealing With Setting the 

Damping Factor 

Several schemes have been proposed for the 
computation of the damping factor in damped least 
squares without dynamic weighting. Most of these 
schemes are aimed at providing singularity robustness 
and do not include provisions for joint limits. Nakamura 
and Hanafusa^ proposed that the damping factor be 
calculated as follows: 

„^[ao(l-w/wo)^ ifw<wo 

[ 0 ifw>wo ^ ^ 


(JJ^+oI) = LDL^. It is shown that the parameters ^ , 
dg, and d, can be calculated inexpensively as by- 
products of Guassian elimination in the solution of the 
damped least squares problem. A method was suggested 
by Chan and Lawrence^ for the calculation of the 
damping factor for both singularity and workspace 
boundary robustness. They give the equation 

a = ao||8xef (21) 

where is the error of the manipulator in the task 
space and ao is a constant. None of the methods given 
in first three references'*^ provide for a non-zero 
damping factor near joint limits. This is because they 
are established only for singularity robustness. The last 
scheme"^, which does include provisions for joint limits, 
has questionable performance in providing a well 
conditioned matrix near singularities. Also, its 
performance with load generated task space errors is 
questionable. All of these methods are intended to be 
used throughout the workspace of the manipulator. 
While it is theoretically justified to find a general 
method to handle all cases, perhaps a more effective 
solution would be to find measures which target specific 
singularities and joint limits. Such measures could also 
be used in the setting of the joint weights to specifically 
target the problem joint(s) for that singularity. 


where w = ^det(JJ^) is Yoshikawa’s manipulability 
measure^ and ao and wo are constants to be determined 
experimentally. A modification to this scheme is given 
Kelmar and Kholsa^. They suggest a method of 
calculating the damping factor using the manipulability 
measures at the ith and (i+l)th iterations. Mayorga et 
al.^ proposed a scheme which establishes an upper 
bound, for ihe condition number of the matrix 
(JJ^ +al). In their scheme a is calculated for the next 
iteration using parameters calculated in the present 
iteration and the three constants ao, ^o» The 

method is described as follows: 


ai+i = 


ao(l-£/eo)^ 

0 


ife>eo 

ife^eo 


( 20 ) 


where £ = tn^^dg / dj . Here C is O'® upper bound of the 
infinity norm ||LL, dg and djare the greatest and 
smallest elements, respectively, of the diagonal matrix 
D, and L is the unit triangular matrix such that 


63 Scheme for Setting the Weighting Matrices and 
the Damping Factor 

A scheme for setting the damping factor and joint 
weights is presented here. It considers both singularities 
and joint limits. The equations for this scheme, (22) 
through (29), are given below. In each iteration of the 
kinematic calculations, it determines a damping factor 
and a set of joint weights for each singularity. It also 
determines a damping factor and a joint weight for each 
pair of joint limits. However, only the maximum 
damping factor, and the minimum weight for each joint, 
are used in the inverse kinematics solution (equations 
(28) and (29)). 

The damping factor ranges in value from zero, when 
the manipulator is not in the region of a singularity or 
limit, to tto , when the manipulator is at a singularity or 
limit. The joint weights range in value from one, away 
from singularities and limits, to WqOs s' 3 singularity, 
and to Wq 0 { at a joint limit. Different minimum values 
fw the joint weights are used at singularities and limits 
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because at a limit it is desired to completely eliminate 
the joint from the solution, while at singularity it might 
only be desired to control the rate of the joint. To 
prevent sticking at the limit, zero is not used. In practice 
a small joint differential obtained using a small joint 
weight, rather than zero, can be discarded. 

For the singularities the damping factor and joint 
weights are set using a measure of distance from the 
singularity (equations (22) and (23)). A measure at 
distance, either a linear distance or an angle, must be 
identified for each singularity that is reachable by the 
manipulator. The joints that need additional damping 
also need to be identified for each singularity. If it is not 
possible to make these identifications, then (22) might be 
rqilaced by a scheme similar to those given in (19) or 
(20). However, the benefits of dynamic joint weighting 
are lost, for singularities, if these schemes are used. 


Wqs„ ,w,^ joint weight of joint i as calculated for 

singularity j and for joint limit i 

dj^ ,d(^ "distance" from singularity j and from 

joint limit i 

u^. temporal ramp for joint limit i 

8qjoid differential for joint i from previous 

iteration 


Jao(l-(ds /dos,)^) 

I (j ifd,.^do,. 


(22) 


if d,. >dos, origAj (23) 

where W(ds.) = w^q, +(1-Wq0s)dsj / do,. 


Jhilimj-qi if8qiou >0 
Iqj-lolimj if6qioid<0 


Because a joint limit is a one sided problem, 
temporal ramps are used in setting the damping factor 
and weights, when considering joint limits (equations 
(26) and (27)). These ramps are calculated using: one as 
an upper bound; a dynamic lower bound found using the 
distance from the limit the joint is moving towards; and 
a discrete step (equations (24) and (25)). If a joint is in a 
joint limit region and moving towards the limit, then its 
weight is ramped down to a value that is found using the 
distance from limit. If it is moving away from the limit, 
then its weight is ramped back up to 1. 

The following nomenclature and equations are 
defined for the algorithm used to set the damping factor 
and joint weights. 

Constants 

tto upper limit of damping factor 

Wqo,,WqO{ lower limits of joint weights for 

singularities and joint limits 

dos. .doe ®dge the region for singularity j and 

edge of region for joint limits 

8u step to increase or decrease the joint 

limit ramps in successive iterations 
hilimj .lolimj ....high and low limit for joint i 

Aj set of joints for increased damping near 

singularity j 

Variables 

a*. ,a{^ damping factor as calculated for 

singularity j and for joint limit i 




fmax{uf. -5u, dj, /dojl if dj. <do( 
1 min{u^. +5u, 1} if dj. >doj 

=Wq0|+(l-Wq0{)u,. 

=Oo(l-u,^) 

a = max|m^(as. ), max(a(, )| 

Wq. =min|min(w 


I 


nin(Wqs„ ), w<^. | 


(25) 

(26) 

(27) 

(28) 
(29) 


A constant is used to normalize the Jacobian 
matrix. The first three rows of the Jacobian are related 
to position, which is measured in linear units, and the 
last three are related to orientation, which is measured in 
radians. The elements of are set as follows: 




= NORM 
= 1 


(30) 


Here NORM s jt / (max reach of manipulator) . 

7 Discussion of an Implementation of 
Weighted Damped Least Squares 

Weighted damped least squares, along with the 
scheme for setting the weights and damping factor, is 
used in the control of a slave manipulator in a 
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teleoperational master and slave system. The controller 
for the slave, which runs on a 33 MHz 486 PC/AT at 
about 300 Hz, includes: analog and digital interfaces, 
joint servos, inverse kinematics, and communications 
with the master controller. The telerobodc manipulators 
used in the system are the Kraft master and slave. 
These manipulators have six degrees of freedom and are 
kinematically similar. In the particular control system 
described however, they are not controlled using a joint 
space mapping between the master and slave, which is 
normally true of kinematically similar master and slave 
systems. Instead, a Cartesian space mapping, with 
scaling and indexing, is used. Therefore, the master can 
command unreachable conHgurations to the slave. 
Furthermore, the master may not be near a singularity, 
and therefore not hindered in any direction, while the 
slave is operating near a singularity, and dierefore with 
limited capabilities in certain directions. Weighted 
damped least squares is used in dealing with these 
unreachable configurations and singularities, in real time 
within the local control loop of the slave. Exact 
solutions of equation (2) are found when the slave is 
away from joint limits and singularities, and 
approximate solutions are found when the slave is at or 
near a joint limit or singularity. 

This inverse kinematics algorithm performs quite 
well in the system described above. The human operator 
is free to move the master about without worrying about 
what will happen when the slave is given physically 
unrealizable commands. The approximate solutions are 
both smooth and stable when the manipulator contacts 
the environment and/or when the manipulator is in 
sevml joint limit and/or singularity regions. Operation 
near a singularity results in approximate solutions with 
damped motion for the joints which swing about 
dangerously if the exact solution is used. Operation near 
joint limits result in approximate solutions that do not 
use the limited joint The movements of the end effector, 
resulting from the approximate solutions, are intuitive to 
the operator. This is because the solutions are 
approximate in the task space. In contrast, solutions that 
are approximate in the joint space are not intuitive to the 
operator. Approximate joint space solutions result from 
an exact mathematical solution to equation (2) with 
partial implementation in joint space due to the physical 
limitations. 

If the parameters of the algorithm are tuned well, 
the transitions between exact and approximate solutions 


and between different approximate solutions are smooth. 
However, if small values are used for do$. and/or dg^ 
then the manipulator tends to "jerk” when transitioning 
from one solution to the next. If a large value is used for 
Wq 0 (, then the resulting mathematical solution uses the 
limited joint even at the limit, and the physical solution, 
which is a partial implementation of the mathematical 
solution, is not intuitive. However, if zero is used for 
Wqot then the joint differential does not reverse and 
allow the joint move away from the limit. It was found 
that a small value of Wq 0 { is sufficient to allow this to 
happen. It was also found experimentally that a small 
value fcv Oq was sufficient. Although it is not done 
here, a minimum value for Uq might be developed using 
some theoretical justification such as an upper bound on 
the condition number of ( JJ^ + txl ) at the singularities. 

8 Conclusion 

In this paper a general procedure for the calculation 
of inverse kinematics with singularity and joint limit 
robusmess has been presented. The procedure uses a 
damped least squares solution to solve the inverse 
kinematics iteratively and incorporates dynamic joint 
weighting for the reduction of specific joint differentials. 
The procedure gives an approximate solution at or near 
singularities and/or joint limits and an exact solution 
away from singularities and joint limits. An algorithm 
was given for the efficient calculation of the inverse 
kinematics using the procedure and a scheme for setting 
damping factor and joint weights was given. The 
algorithm and scheme were implemented for a six d.o.f. 
teleoperator and a well behaved slave manipulator 
resulted under teleoperational control. 
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Abstract 

This paper describes ControlShell, a next- generation 
CASE ^Jramework” for real-time system software de- 
velopment. ControlShelVs well-defined structure, graph- 
ical tools, and data management provide a unique 
component-based approach to real-time software gener- 
ation and management. ControlShell is designed specif- 
ically to enable modular design and implementation of 
real-time software. By defining a set of interface specifi- 
cations for inter-module interaction, ControlShell pro- 
vides a basis for real-time code development and ex- 
change. 

ControlShell includes many system-building tools, in- 
cluding a graphical data flow editor, a component data 
requirement editor, and a state-machine editor. It also 
includes a distributed data flow package, an execution 
configuration manager, a matrix package, and an object 
database and dynamic binding facility. ControlShell is 
being used in several applications, including the control 
of free-flying robots, underwater autonomous vehicles, 
and cooperating- arm robotic systems. 

This paper presents an overview of the ControlShell 
architecture, and details the functions of several of the 
tools. 

1 Introduction 

Motivation System programs for real-time command 
and control are, for the most part, custom software. 
Emerging operating systems [1, 2, 3, 4, 5] provide 
some basic building blocks — scheduling, communica- 
tion, etc. — but do not encourage or enable any struc- 
ture on the application software. Information binding 
and flow control, event responses, sampled-data inter- 
faces, network connectivity, user interfaces, etc. are all 
left to the programmer. As a result, each real-time sys- 
tem rapidly becomes a custom software implementation. 
With so many unique interfaces, even simple modules 
cannot be shared or reused. 
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An effective real-time framework must create a pro- 
gramming environment that facilitates sharing and reuse 
of real-time program modules. At a minimum, this re- 
quires providing interface specifications and data trans- 
fer mechanisms. The framework must also provide ser- 
vices and tools to combine modules and build systems 
from reusable components. Finally, the framework must 
meet the many challenges unique to real-time comput- 
ing. For example: 

• Real-time code must be able to react to external 
temporal events. 

• The real-time execution environment is fundamen- 
tally multi-threaded and asynchronous. 

• Real-time systems are usually composed of several 
different layers of control, each with different char- 
acteristics. For instance, strategic- level command 
and low-level servo control must be blended into a 
smoothly-operating system. 

• Real-time systems must handle changing condi- 
tions, often requiring switching between drastically 
different modes of operation. 

• Real-time systems are often physically distributed. 
In the simplest case, an operator control station 
may be remotely situated. More complex systems 
are comprised of many interacting distributed real- 
time and non- real-time subsystems. 

All these challenges must be efficiently and smoothly 
handled by the architecture. 

1.1 ControlShell’s Solutions 

Component-Based Design ControlShell is specifi- 
cally designed to address these issues. ControlShell pro- 
vides interface definitions and mechanisms for building 
real-time code modules. ControlShell also provides basic 
data structure specifications, and mechanisms for bind- 
ing data with routines and specifying data-flow require- 
ments. These two critical features make simple generic 
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packages (known as components) possible. Controls hell 
systems are built from combinations of these compo- 
nents. 

An extensive library of pre-defined components is pro- 
vided with the system, ranging from simple filters and 
controllers to complex trajectory generators and mo- 
tion planning modules. New or custom components are 
easily added to the system via the graphical Compo- 
neni Editor (CE). The Component Editor allows simple 
specification of data interchange requirements. Code is 
automatically generated to permit instancing the new 
component into the system. 

Graphical CASE System-Building Tools Con- 
trolShell also provides a set of powerful development 
tools for building complex systems. Building a system is 
accomplished by connecting components within a graph- 
ical Data Flow Editor (DFE). The data flow editor re- 
solves the system data dependencies and orders the com- 
ponent modules for most efficient execution. Radical 
mode changes are supported via a “configuration man- 
ager” that permits quick reconfiguration of large num- 
bers of active component routines. 

Real-time systems also require higher-level control 
functions. ControlShell’s event-driven finite state ma- 
chine (FSM) capability provides easy strategic control. 
The state machine model features rule-based transition 
conditions, true callable sub-chain hierarchies, task syn- 
chronization and event management. A graphical FSM 
editor facilitates building state programs. 

Real-Time System Services To provide support 
for real-time distributed systems, ControlShell includes 
a network connectivity package known as the Net- 
work Data Delivery Service (NDDS). NDDS provides 
distributed data flow. It naturally supports multiple 
anonymous data consumers and producers, arbitrary 
data types, and on-line reconfiguration and error recov- 
ery. 

ControlShell also offers a database facility, direct sup- 
port for sampled-data systems, a full matrix package, 
and an interactive menu system. Figure 1 presents 
an overview of the ControlShell toolset and design ap- 
proach. 

2 Relation to Other Research 

There are two quite different issues in real-time software 
system design: 

• Hierarchy (what is communicated) 

• Superstructure architecture (how it is communi- 
cated) 

Several efforts are underway to define hierarchy speci- 
fications; NASREM is a notable example [6]. Control- 
Shell makes no attempt to define hierarchical interfaces. 



Figure 1: ControlShell Design Process 

The ControlShell systein designer uses the many powerful 
tools j system ser vices ^ and prebuilt library components to 
construct a modular system. 

but rather strives to provide a sufficiently generic soft- 
ware platform to allow the exploration of these issues. 
As such, this work takes a first step — defining the ar- 
chitecture superstructure (control and data flow mech- 
anisms). 

Several distributed data-flow architectures have been 
developed, including CMU’s TCA/X [7, 8], Rice Uni- 
versity’s TelRIP [9], and Sparta’s ARTSE [10]. These 
provide various levels of network services, but do not 
address repetitive service issues or resolve multiple data- 
producer conflicts in a symmetric robust “stateless” 
architecture as does the ControlShell NDDS system 
(see [11] for details). Also, they are not integrated 
within a general programming system. 

Recently, more sophisticated programming environ- 
ments have begun to emerge. For example, ORCAD [12] 
and COTS [13] are specialized robotics programming 
environments. Two commercial products. System Build 
with Autocode from Integrated Systems, Inc. [14], and 
SIMULINK with C-Code Generation from the Math- 
Works, Inc. [15] are sophisticated control development 
environments. They offer easy-to-use rapid control sys- 
tem prototyping. They are not, however, architec- 
tures well suited to developing complex multi-layer dis- 
tributed control hierarchies. 

Implementation Experience ControlShell evolved 
from many years of research with real-time control sys- 
tems. It was first developed for use with a multiple- 
arm cooperative robot project at Stanford University’s 
Aerospace Robotics Laboratory [16, 17, 18]. From this 
start, ControlShell spread to become the basis for more 
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than 20 research projects in advanced control systems at 
Stanford. Among these were projects to study the con- 
trol of flexible structures, adaptive control, control of 
mobile robots (including multiple coordinated robots), 
and high-bandwidth force control [19, 20, 21, 22, 23, 24]. 
More recently, a few industrial sites and two NASA cen- 
ters have begun experimenting with ControlShell ap- 
plications [25, 26]. ControlShell is now being jointly 
developed by Stanford University and Real-Time Inno- 
vations, Inc. It is supported under ARPA’s Domain- 
Specific Software Architectures (DSSA) program. 

This continuous migration from specific, working ap- 
plications to wider spectrums of use is the key to usable 
generality. These applications continue to drive Control- 
Shell’s growth. To our knowledge, ControlShell is the 
only integrated framework package combining transpar- 
ent networking, component-based system description, a 
state machine model, and a run-time executive. 

3 Run-Time Structure 

Some of the major system modules are shown in Fig- 
ure 2. As shown in the figure, ControlShell is an open 
system, with application- accessible interfaces at each 
level. The figure is organized (loosely) into data and 
execution hierarchies. 

At the lowest layer, ControlShell executes within the 
VxWorks real-time operating system environment. The 
simple base class known as CSModules is the building 
block for most executable constructs. Organizations of 
these modules, into lists, menus, and finite state ma- 
chines form the core executable constructs. Users build 
useful execution-level atomic objects called components 
by defining derived classes from CSModules and bind- 
ing them through the on-line data base to data matrices 
from the CSMat package. High-level graphical editors 
speed component definition, data flow specification and 
state machine programming. Network connectivity is 
provided by NDDS for all application modules. 

4 Data Flow Design 

Many real-time systems contain sampled-data subsys- 
tems. Here, we define a “sampled-data” system as any 
system with a clearly periodic nature. Common exam- 
ples (each of which have been implemented under Con- 
trolShell) are digital control systems, real-time video im- 
age processing systems, and data acquisition systems. 
Each of these is characterized by a regular clock source. 

Providing an environment where sampled-data pro- 
gram components can be interchanged is challenging. 
These programs have routines that must be executed 
during the sampling process, routines to initialize data 
structures (or hardware) when sampling begins, and per- 
haps to clean up when sampling ends. Further, many 



Figure 2: Run-Time Structure 

ControlShell*s open architecture provides many powerful ser^ 
vices, while allowing application code free access to the un- 
derlying structures. 

routines are dependent on knowledge of the timing pa- 
rameters, etc. Although they may interact — say by pass- 
ing data — sampled-data program components are often 
relatively independent. Requiring the application code 
to call each module’s various routines directly destroys 
modularity. 

4.1 Components 

The component is the fundamental unit of reusable data- 
flow code in ControlShell. Components consist of one or 
more sample modules derived from CSModules. Sample 
modules have several pre-defined entry routines, includ- 
ing: 


Routine 

When executed 

execute 

Once each sample period 

stateUpdate 

After aU executes are done 

enable 

When this module is made active 

disable 

When it is removed from the active 
list 

startup 

When sampling begins 

shutdown 

When sampling ends 

timingChanged 

When the sample rate changes 

reset 

When the user types “reset”, or calls 
CSSampleReset 

terminate 

When the module is unloaded 

Thus, a motor 

‘ driver component might define 


startup routine to initialize the hardware, an execute 
routine to control the motor, and a shutdown routine 
to disable the motors if sampling is interrupted for any 
reason. In addition, if any of its parameters depend on 
the sampling rate, it may request notification via a tim- 
ingChanged method. By allowing components to attach 
easily to these critical times in the system, ControlShell 
defines an interface sufficient for installing (and there- 
fore sharing) generic sampled-data programs. 
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Building Components: The Component Editor 
An easy-to-use graphical tool called the component edi- 
tor (CE) assists the user in generating new components 
and specifying their data-flow interactions. The com- 
ponent editor defines all the input and output data re- 
quirements for the component, and creates a data type 
for the system to use when interacting with the com- 
ponent. The tool contains a code generator; it auto- 
matically generates a description of the component that 
the Data-Flow Editor can display (see below), and the 
code required to install instances of the component into 
ControlShell’s run-time environment. 

4.2 Execution Lists 

An execution list is simply a dynamically changeable, or- 
dered list of sample modules to be sequentially executed. 
The active set of modules on a list can be changed any- 
time. In fact, lists may drastically change their contents 
during system mode changes. 

Execution lists may be sorted to provide automated 
run-time execution scheduling to resolve data dependen- 
cies. More specifically, the modules are sorted so that 
data consumers are always preceded by the appropriate 
data producers (see Figure 3), The system uses the spec- 
ifications of the data flow requirements for each compo- 
nent to sort the dependencies and order the list. A side 
benefit of the sorting process is the error-checking that 
is performed to insure consistent data flow patterns. 


Sample 


1 


A2DRead 






positions, velocities 


D2AWrite 




y 


torques 


Figure 3; Dependency- Sorted List 

Dependency-sorted execution lists provide automatic run- 
time sorting by data dependencies. 


Sample Habitats ControlShell provides a named 
sampled-data environment, known as a sample habitat. 
A sample habitat encapsulates all the information and 
defines all the interfaces required for sampled-data pro- 
grams to co-exist. It also contains routines to control 
the sampling process. For example, a module installed 


into a sample habitat can query its clock source and 
sample rate, start and stop the sampling process, etc. 

Each sample habitat contains an independent task 
that executes the sample code. The task is clocked by 
the periodic source (such as a timer interrupt). Special 
components are provided to interface between habitats, 
allowing multi-rate controller designs. 

4.3 Building Systems: The DFE Editor 

Building systems of components is made simple by the 
graphical Data-Flow Editor (DFE). The DFE reads de- 
scription files produced by the component editor, and 
then allows the user to connect components in a friendly 
graphical environment. It allows specification of all the 
data connections in the system, as well as reference 
inputs — gains, configuration constants and other param- 
eters to the individual components. An example session 
is depicted in Figure 4. 



Figure 4: Data-Flow Editor 

The data-flow editor builds collections of components into 
an executing system. 


5 Configuration Management 

Complex real-time systems often have to operate under 
many different conditions. The changing sets of condi- 
tions may require drastic changes in execution patterns. 
For example, a robotic system coming into contact with 
a hard surface may have to switch in a force control al- 
gorithm, along with its attendant sensor set, estimators, 
trajectory control routines, etc. 

ControlShelFs configuration manager directly sup- 
ports this type of radical behavior change; it allows en- 
tire groups of modules to be quickly exchanged. Thus, 
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different system personalities can be easily interchanged 
during execution. This is a great boon during develop- 
ment, when an application programmer may wish, for 
example, to quickly compare controllers (See Figure 5). 
It is also of great utility in producing a multi-mode sys- 
tem design. By activating these changes from the state- 
machine facility (see below), the system is able to handle 
easily external events that cause major changes in sys- 
tem behavior. 


Configuration Hierarchy The configuration man- 
ager essentially creates a four-level hierarchy of module 
groupings. Individual sample modules form the low- 
est level. These usually implement a single well-defined 
function. Sets of modules, called module groups^ com- 
bine the simple functions implemented by single mod- 
ules into complete executable subsystems. 

Each module group is assigned to a category. One 
group in each installed category is said to be active^ 
meaning its modules will be executed. Finally, a config- 
uration is simply a specification of which group is active 
in each category. 



Figure 5: Configuration Manager 

Configurations can be swapped in or out under program or 
menu control This provides flexible run-time reconfigura- 
tion of the execution structure. 


Example As a simple example, consider a system 
with two controllers: a proportional-plus-derivative con- 
troller named “PD”, and an optimal controller known 
as “LQG”. Suppose the PD controller requires filtered 
inputs, and thus consists of two sample modules: an in- 
stance of the PDControl component and a filter compo- 
nent. These two components would comprise the “PD” 
module group. The “LQG” controller module group 
may also be made up of several components. Both of 
these groups would be assigned to the category “con- 
trollers” . 


The user (or application code) can then easily switch 
controllers by changing the active module group in the 
“controller” category. 

Now suppose further that the controllers require a 
more sophisticated sensor set. A category named “sen- 
sors” may also be defined, perhaps with module groups 
named “endpoint” and “joint” . The highest level of the 
hierarchy allows the user to select an active group from 
each category, and name these selections as a configu- 
ration, Thus, the “JointPD” configuration might con- 
sist of the “joint” sensors and the “PD” controller. The 
“endptLQG” configuration could be the “endpoint” sen- 
sors and the “LQG” controller. 


Category and Group Specification This subdivi- 
sion may seem complex in these simple cases. However, 
it is quite powerful in more realistic systems. It has 
been shown to be quite natural in applications rang- 
ing from a vision-guided dual-arm robotic system able 
to catch moving objects [16] to flexible-beam adaptive 
controllers [27]. 

Assigning modules to groups and groups to categories 
is made quite simple with the ControlShell graphical 
DFE editor’s “configuration definition” window, shown 
in Figure 6. New categories are added with the click 
of a button. To create a module group, the user sim- 
ply names a group, and then clicks on the modules in 
the data-flow diagram that should belong to that group. 
The blocks are color-coded to relate the selections back 
to the user. 



Figure 6: Configuration Definition 

Configurations are easily defined within the DFE graphical 
interface. 
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6 Finite State Machines 

A real-time system in the real world must operate in 
a complex, event-driven environment. With only a se- 
quential programming language, the burden of manag- 
ing and reacting to events is left to the programmer. 

The Finite State Machine (FSM) module is designed 
to provide a simple strategic-level programming struc- 
ture that also assists in managing events and concur- 
rency in the system. The FSM module combines a 
non-sequential programming environment with natural 
event-driven process management. With this structure, 
the programmer is actively encouraged to divide the 
problem into small, independently executing processes. 

To utilize the FSM module, the programmer first 
describes the task as a state transition graph. The 
graph can be directly described within ControlShelPs 
graphical FSM editor (see Figure 7). Each transition — 
represented by an arrow in the graph — specifies a start- 
ing state, a boolean relation between stimuli that causes 
the transition, the CSModule to be executed when the 
transition occurs, and a series of “return code-next 
state” pairs that determine the program flow. 

The FSM model is quite general; it supports rule- 
based transition conditions (reducing the number of 
states in complex systems), true callable sub-chains of 
states (so libraries of state subroutines can be devel- 
oped), wild-card matching (so unexpected stimuli can 
be processed), global matching (allowing easy error pro- 
cessing), and conditional succession (so state programs 
may easily branch). Transitions are specified as boolean 
relations of three types of stimuli: transient, latched, 
and conditional. Transient stimuli have no value, and 
exist only instantaneously. Latched stimuli also have 
no value, but persist until some transition expression 
matches. Condition stimuli have string values; they per- 
sist indefinitely and thus represent memory in the sys- 
tem. Thus, the transition condition “Object = Visible 
AND Acquire” might cause a system to react to an ac- 
quisition command from a high-level controller. Provid- 
ing these three stimuli types allows combination of both 
“system status” and “event” types of asynchronous in- 
puts into easily-understood programs. 

The FSM module takes advantage of the atomic 
message-passing capability of modern real-time kernels 
to weave the incoming asynchronous events into a single 
event stream. Any process can call a simple routine to 
queue the event; the FSM code spawns a process to ex- 
ecute the resulting event stream. The result is an easy- 
to-use, yet powerful real-time programming paradigm. 

7 Data Control and Binding 

Most data in a ControlShell application is embodied 
in CSMats. A CSMat is a named matrix of floating- 
point values. Each row and column of the matrix op- 



Figure 7: Finite State Machine Editor 

State transition graphs allow easy visualization of multi-step 
operations; this example is a (simplified) program to catch a 
moving object with a dual-arm manipulator. 

tionally contains a field name and a units specification. 
A complete real-time matrix mathematics utilities pack- 
age is included. Components may combine multiple CS- 
Mats into structures for efficient reference and parame- 
ter passing. 

The entire control hierarchies are created and bound 
to the correct data objects at run-time. The system 
is built from the graphically-generated description files 
produced by the DFE and FSM editors. This dynamic 
binding paradigm is very powerful — it combines the con- 
venience of automatic system building with the flexibil- 
ity of a dynamically changeable system. Thus, it pro- 
vides the features of a full code generation without the 
pre-compiled inflexibility. 

To support this dynamic binding, ControlShell incor- 
porates a “linking” database facility. All instances of 
each data object (such as CSMats) — and each control 
construct (such as execution lists) — are entered into the 
database upon creation. The database allows “refer- 
ence before creation” semantics for many object types; 
if a requested object is not in the database (i.e. it does 
not exist), an incomplete (e.g. zero-sized) object will 
be created by the database itself. This capability allows 
considerable flexibility at run-time; modules may, for in- 
stance, specify dependencies on data sets that do not yet 
exist, etc. Verification routines insure that the system 
is consistent before actual “live” execution begins. 

8 Network Connectivity 

ControlShell is integrated with a network connectiv- 
ity package called the Network Data Delivery Service 
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(NDDS) [11]. NDDS is a novel network-transparent 
data-sharing system. NDDS features the ability to han- 
dle multiple producers, consumer update guarantees, 
notifications or “query” updates, dynamic binding of 
producers and consumers, user-defined data types, and 
more. 

The NDDS system builds on the model of information 
producers (sources) and consumers (sinks). Producers 
register a set of data instances that they will produce 
and then “produce” the data at their own discretion. 
Consumers “subscribe” to updates of any data instances 
they require. Producers are unaware of prospective con- 
sumers; consumers are not concerned with who is pro- 
ducing the data they use. Thus, the network configura- 
tion can be easily changed as required. NDDS is a sym- 
metric system, with no “special” or “privileged” nodes 
or name servers. All nodes are functionally identical and 
maintain their own databases. The routing protocol is 
connectionless and “quasi-stateless^”; all data producer 
and consumer information is dynamically maintained. 
Thus dropped packets, node failures, reconfigurations, 
over-rides, etc. are all handled naturally. 

This scheme is particularly effective for systems (such 
as distributed control systems) where information is of 
a repetitive nature. NDDS is an efficient, easy-to-use 
distributed data-sharing system. Figure 8 illustrates the 
use of NDDS within a cooperating-arms robot system 
(see [24]). 



Figure 8: Network Data Delivery Service 

NDDS provides a network ^backplane**. Each module can 
easily share data with any other module. The individual 
connections are handled transparently by the system. 


^The databases at each node cache some state for efficiency, 
but all information decays over time. 


9 Conclusions 

This paper has presented a brief overview of the ca- 
pabilities of the ControlShell system. ControlShell is 
designed — first and foremost — to be an environment 
that enables the development of complex real-time sys- 
tems. Emphasis, therefore, has been placed on a clean 
and open system structure, powerful system-building 
tools, and inter-project code sharing and reuse. 
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Abstract 

This paper presents an overview of the 
proposed Lyndon B. Johnson Space Center (JSC) 
precompetitive, dual-use technology investment 
project in robotics. New robotic technology in 
advanced robots, which can recognize and respond 
to their environments and to spoken human 
supervision so as to perform a variety of combined 
mobility and manipulation tasks in various sectors, 
is an objective of this work. In the U.S. economy, 
such robots offer the benefits of improved global 
competitiveness in a critical industrial sector; 
improved productivity by the end users of these 
robots; a growing robotics industry that produces 
jobs and profits; lower cost health care delivery 
with quality improvements; and, as these 
"intelligent" robots become acceptable through- 
out society, an increase in the standard of living for 
everyone. In space, such robots will provide 
improved safety, reliability, and productivity as 
Space Station evolves, and will enable human 
space exploration (by human/robot teams). 

The proposed effort consists of partnerships 
between manufacturers, universities, and JSC to 
develop working production prototypes of these 
robots by leveraging current development by both 
sides. Currently targeted applications are in the 
manufacturing, health care, services, and 
construction sectors of the U.S. economy and in the 
inspection, servicing, maintenance, and repair 
aspects of space exploration. But the focus is on 
the generic software architecture and standard- 
ized interfaces for custom modules tailored for the 
various applications allowing end users to 
customize a robot as PC users customize PC's. 
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Production prototypes would be completed in 
5 years under this proposal. 


1. Introduction 

This paper suggests a large number of 
opportunities for robotic manufacturers, integra- 
tors, potential buyers/users of robots, commercial 
technology developers, and universities to work 
with the NASA JSC Automation and Robotics 
Division, with NASA funding a major portion of the 
development. The focus is intelligent robotics as 
partial solutions to productivity problems in 
several sectors of application. The stage of devel- 
opment addressed is precommercial. In each case 
dual use is a prerequisite: there must be a space 
use as well as a nonspace, commercial use. 
Generally, this is easily the case. 

The specific motivation and rationale for this 
NASA JSC proposed technology investment pro- 
gram is detailed in Erickson 1. The general policy 
that sets the context for the NASA technology 
investment program, which will begin in 1994, is 
given in Clinton and Gore 2. 

It is important to understand that although a 
set of objectives, an approach, and a number of 
tasks are suggested here, these are meant to 
stimulate the creative thought process of those in 
nonaerospace and aerospace industry to propose 
objectives, approaches, and tasks that they believe, 
due to their involvement with their commercial 
buyers/users, would be economic and profitable as 
a result of jointly funded developmental efforts 
with NASA. 

Intelligent robotics is the use of robotic 
systems in solving problems in tasks and environ- 
ments where the robot's ability to acquire and 
apply knowledge and skills to achieve stated goals 
in the face of variations, difficulties, and complex- 
ities imposed by a dynamic environment having 
significant unpredictability is crucial to success. 

This means the robots can recognize and respond 
to their environments at the pace of their environ- 
ments and to spoken human supervision so as to 
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perform a variety of mobility and manipulation 
tasks. This does not require broad-based general 
intelligence or common sense by the robot. 

These robots are capable of significant, 
autonomous reaction to unpredictable events, yet 
are subject to optional human supervision during 
operation in a natural way, such as by voice. We 
refer to this capability in the supervised robot as 
"adjustable autonomy." Also, a key essence is that 
previously acquired knowledge is combined with 
knowledge acquired at the instant of task 
performance. 

The overall approach can be summarized as 
capitalizing on a software architecture that can be 
viewed as generic and modular, and hardware 
approaches that are modular, reconfigurable, and 
extendible. Many of the software modules, such as 
a deliberative planner, world model, and natural 
language interface, can also be viewed as generic. 
Other software is bundled with certain hardware; 
e.g., sensing software is bundled with specific 
sensor hardware. This leads to the concept of a 
modular, end-user customized robot, put together 
from modules with standard interfaces3>^>5 such 
as users do with a personal computer. An integra- 
ted computer aided concurrent engineering 
environment that we are working on 6 is a way to 
achieve close teamwork by geographically distrib- 
uted "virtual" teams to develop the production 
prototypes. 

JSC can be a key partner in this dual-use 
technology investment program in intelligent 
robotics for two reasons: (1) human space explor- 
ation missions require supervised intelligent 
robotics as enabling tools^-S and, hence, must 
develop or have developed supervised intelligent 
robotic systems and (2) intelligent robotic technol- 
ogy is being developed for space applications at 
JSC (but has a strong crosscutting or generic flavor) 
that is advancing the state of the art^- and is 
producing both skilled personnel and adaptable 
developmental infrastructure, such as low cost 
simulation environments for software testing and 
integrated testbeds for complete prototype 
testing. JSC also has a Small Business Innovative 
Research (SBIR) program n for intelligent robotics, 
which is underutilized and has no commercial cost 
sharing requirement. It is limited in scope to about 
$0.6 million and 2 years in Phase II efforts. 

A key element in the cutting edge intelligent 
robotics technology work at JSC is an understand- 
ing of and solution approach to the key issue of 
melding artificial intelligence planners with 


reactive capabilities. Artificial intelligence 
planners offer goal-achieving planning, but also 
high-time variance due to searching. Reactive 
capabilities are needed to deal safely in real time 
with dynamic, unpredictable environments at the 
paceof thedynamics^. Asecond key element that 
JSC brings is an approach to improved robotic reli- 
ability as required for space, but also useful in 
industry. A third key element that JSC brings to 
cutting edge technology is an understanding of 
and solution approach to the key issue of robotic 
safety while maintaining productivity. 

Of all these elements, the personnel skilled in 
the state of the art and knowledgeable about the 
technology are the most important. 


2. Overview of Proposed Activities 

New robotic technology in advanced robots 
that can recognize and respond to their environ- 
ments and to spoken human supervision so as to 
perform a variety of mobility and manipulation 
tasks in various sectors is an objective of this 
proposed effort. In the U.S. economy, such robots 
offer the benefits of improved global competitive- 
ness in a critical industrial sector; improved 
productivity by the end users of these robots; a 
growing robotics industry that produces jobs and 
profits; lower cost health care delivery with quality 
improvements; and, as these "intelligent" robots 
become acceptable throughout society, an 
increase in the standard of living for everyone ’2. 

In space, such robots will provide improved safety, 
reliability, and productivity as Space Station 
evolves, and will enable human space exploration 
(by human/robot teams). 

The proposed effort consists of partnerships 
between manufacturers, users, universities, and 
JSC to develop working production prototypes of 
these robots by leveraging current development by 
manufacturers and JSC. Currently targeted appli- 
cations are in the manufacturing, health care, 
services, and construction sectors of the U.S. 
economy and in the inspection, servicing, mainte- 
nance, and repair aspects of space exploration. But 
the focus is on the generic software architecture 
and standardized interfaces for custom modules 
tailored for the various applications, allowing end 
users to customize a robot as personal computer 
users customize PC's. Production prototypes would 
be completed in 5 years under this proposal, as 
would automated developmental environments 
and integrated testbeds. 
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JSC possesses the required core skills in its civil 
service and contractors to form the nucleus of the 
multiple partnerships. Current technology integra- 
tion efforts at JSC include the EVA helper/retriever 
supervised intelligent robot io, the mobile robotics 
testbed project, and the Soda-Pup entry in the 
AAAI national robotics competition (1992 award 
winner). In addition, JSC is responsible for 
engineering upgrades to the Shuttle Remote 
Manipulator Systems, integration of the Mobile 
Servicing Systems and Special Purpose Dexterous 
Manipulator into Space Station, and numerous 
robotics technology efforts. 

User coordination involves interested 
manufacturers with deployed robots. Joint facility 
sharing and temporary personnel exchange are 
possible. 

The overall set of activities has been grouped 
into the following seven related categories of 
tasks, each with its own objectives and approach. 

1. Problem-Solving Insertion of Robot 
Intelligence Technology 

2. Generic Intelligent Robotics Software 
Architecture 

3. Modular Manipulation and Mobility for 
Robotics 

4. Integrated Sensing and Perception Capabilities 
for Robotics 

5. Robotic Surrogates for Human Grasping and 
Manipulation 

6. Integrated Prototyping Environment for 
Robotics 

7. Robotic Applications in Advanced 
Manufacturing, Health Care, Service Industries, 
and Construction 

The following sections present the objectives, 
approach, and benefits for each of these catego- 
ries of tasks and give the titles of the set of tasks 
grouped into that category. One-page task 
descriptions are available 1 3 for all tasks, giving 
task objectives, proposed effort, major milestones, 
benefits, and other information. 


Problem-Solving Insertion of Robot Intelligence 
Technology 

The object! ves are ( 1 ) to work as a team with 
end user industries whose productivity problems 
can be solved by integrating adaptive robots into 
the advanced manufacturing or service process. 


and in so doing to develop a new paradigm of 
product line development for robot manufacturers 
and (2) to provide the robotics industry sensor/ 
software control techniques that will make the 
robots more flexible and attractive for use by end 
user industries. This will impact the end users of 
these robots by improving the end users' efficiency 
and productivity and thus improved global 
competitiveness. This will also stimulate robot 
demand and provide a new way of doing business 
for robot manufacturers. The benefits for space 
will be a healthier robotics industry capable of 
supplying quality robotics at lower costs. 

The proposed effort consists of partnerships 
between manufacturers, integrators, nonprofits, 
and JSC to solve end user problems by integrating 
adaptive robots into end user operations. As part 
of that effort the robot manufacturers' products 
must first be upgraded with sensing and intelligent 
reaction capabilities from new sensor/software 
control technology. A key product is the develop- 
ment, documentation, and refinement of the 
problem-solving insertion process for intelligent 
robotics technology, including end user problem 
identification techniques; problem selection 
criteria; requirements definition; development of 
a solution; integration with the end user people, 
processes, and equipment; user training; and 
continuing user support. 

In related work, JSC is responsible for 
integration of the Mobile Servicing System and 
Special Purpose Dexterous Manipulator into Space 
Station, which gives us the necessary experience 
and insight to help users. 

The eight tasks in this category are the 
following: 

• End User Target Problems Identification 

• End User Problem Selection 

• Selected Problem Requirements Definition 

• Design, Development, Test, and Evaluation of 
Solution 

• Integration with User Equipment, Processes, and 
People 

• User Training Definition 

• Continuing User Support Definition 

• Problem Solving Insertion Process Development, 
Documentation, and Refinement 

The benefits of this problem-solving insertion 
are that the end user businesses obtain a useful 
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solution to their problems. The robot manufactur- 
ers and integrators obtain a better understanding 
of the integration process, not only as part of the 
problem-solving insertion of their products but 
also as part of the requirements for capabilities in 
their products. JSC gets a benefit for space 
applications due to understanding of capabilities 
of intelligent robots required to solve certain types 
of problems. 

Generic Intelligent Robotics Software Architecture 

The objective is a generic, supervised 
intelligent robotics software architecture that 
provides a portable software approach that 
integrates intelligent planning and reactive 
control with sensing and internal representation of 
environment to enable advanced robots that can 
recognize and respond to their surroundings and 
to spoken human supervision in order to perform a 
variety of manipulation and mobility tasks. The 
benefits of such an architecture are the faster 
development time, lower cost, and increased 
adaptable and flexible performance. In turn, these 
provide improved productivity by the end users of 
these robots, whether terrestrial or space. 

The proposed effort consists of partnerships 
between manufacturers, nonprofits, and JSC to 
develop the software and evaluate its charac- 
teristics and robustness in several tasks and 
environments. The design of the software 
architecture, which is the framework (functional 
decomposition) that integrates the separate 
functional modules into a coherent system, is 
dictated in large measure by the tasks and nature 
of the environment. Because both the goal- 
achieving tasks and the partially unpredictable 
nature of the environments are similar on Earth 
and in space, the software architecture can be 
viewed as generic. Many of the software modules, 
such as a deliberative planner, world model 
capability, and natural language interface, can also 
be viewed as generic. Other software is bundled 
with certain hardware; e.g., sensing software is 
bundled with specific sensor hardware. 

Current work on the EVA helper/retriever 
supervised intelligent robot, the mobile robotics 
testbed project, and Soda-Pup project at JSC have 
provided us the necessary insight and experience. 
Not just any architecture will do here. It must solve 
the key issue of combining deliberative goal- 
achieving planning with reactive capabilities in 
such a way as not to li m it the i ntel I i gence of the 
planner or the safety of the reactive execution. 


The JSC work is believed to offer such a solution. It 
is a practical implementation of the mathematical 
theory of intelligent robots 

The ten tasks in this category are the 
following: 

• Artificial Intelligence Planning Software 

• Sequencing and Scheduling Software 

• Reactive Controller Software 

• Integration of Natural Language Understanding 
Into Architecture 

• Real-Time Speech Planning Software 

• World Modeling Software 

• Software Development Environment 

• Integrated Software Architecture 

• I ntegrated T esti ngAgainstSimul ated 
Environments 

• Skill Acquisition 

A generic software architecture for super- 
vised intelligent robots will enable portability and 
reuse, major time and cost savings in development 
and testing, more robust and higher quality 
software, and maintenance and training cost 
reductions. People will have a natural means of 
supervision by including task limited natural 
language understanding and speech generation 
software in the robotics software architecture. 
Improved safety of operations is also a benefit. 
These benefits apply in space and in the U .S. 
economy. 

Modular Manipulation and Mobility for Robotics 

The objective here is to develop a set of 
standardized modular components that can be 
reconfigured as required into modular robots 
offering a broad spectrum of tasks, reduced system 
costs, reduced weight, reduced mean time to 
repair, changeout of broken components, and 
reduced operator training. As components for an 
integrated prototyping environment for evalu- 
ating alternate approaches to design of robotic 
systems, these contribute to making adaptive 
robots "faster, better, and cheaper." 

The proposed effort consists of partnerships 
with robotic manufacturers, nonprofits, and 
universities to develop working production 
prototypes of a set of standardized modular 
components. The development of standards for 
mechanical and electrical connections and similar 


modular interfaces will be a product as well. Both 
manipulator and mobility systems with robot body 
structures would be developed. Arm sockets, links, 
joints, actuators, and sensors would be designed 
and developed to standards for manipulators. 
Wheels, tracks, suspension, drive train motors, 
gears, brakes, drive control electronics, structure, 
pan/tilt units, power and communications 
subsystems, and sensors would be designed and 
developed to standards for mobility and body 
systems. 

We have a current effort in designing 
modular components for space manipulation 4. 

The eight tasks in this category are the 
following: 

• Manipulator Socket, Link, Joint, Actuator, and 
Sensor Modular Component Standards 
Development 

• Manipulator Modular Component Designs 

• Manipulator Modular Component Development 

• Mobility Modular Component Standards 
Development 

• Mobility Modular Component Designs 

• Mobility Modular Component Development 

• Modular Robotics Testbed Development 

• Modular Prototype Testing on the Testbed 

Modular, reconfigurable manipulator arm 
and mobility subsystems as part of modular, 
reconfigurable intelligent robots will reduce cost, 
reduce development time and cost, enable more 
uses through reuse and reconfiguration, reduce 
maintenance and repair time and costs, and 
increase availability (uptime). This approach also 
enables low cost, rapid prototyping, and rapid 
development of intelligent robots with testing 
against intended tasks and environments to 
improve quality. All applications are beneficial, 
especially those for space. 


Integrated Sensing and Perception Capabilities for 
Robotics 

Development of the capability to select and 
tailor sensors and real-time perception processing 
to the task- and environment-driven requirements 
of adaptive robots is the objective of this portion 
of the effort. Perception is the extraction of useful 
information about the environment needed to 
understand the situation to complete the task 
successfully. 


These capabilities must be integrated with 
the interface standards of a generic, supervised 
intelligent robotics software architecture. These 
are the most important capabilities enabling 
reactive behavior and deliberative, goal-achieving 
planning and actions. By enabling advanced 
robots to recognize their dynamic environments so 
as to respond appropriately, this effort leads to 
improved productivity by the end users of these 
robots, a growing robotics industry that produces 
jobs and profits, and improved global competitive- 
ness. In space, these capabilities enable robots to 
provide the flexible support that enables space 
exploration (by human/robot teams). 

Integration of sensing and perception into 
planning and control in a robust way is a challenge 
for at least two fundamental reasons. First, the 
time available to sense and perceive the many 
dynamic and unpredictable elements of the 
situation is limited. Second, perception attaches 
meaning to the link between a conception of the 
environment and the objective environment. 
Perception is the process of inference that recog- 
nizes regularities in sensor data that are known on 
the basis of a model of the world to be reliably 
related to causal structure of objects and their 
relations in the environment and then conveys this 
to cognition. Sensory data underdetermines world 
structure; therefore, a model of world structure is 
required. 

Perception involves understanding generic, 
generally applicable models of world structure 
(not merely specific object models) and how that 
causal structure evidences itself in sensor data. 
Causal structure is of interest so as to be able to 
predict consequences, anticipate events, and plan 
actions so as to achieve goals. Perception is 
generally focussed by needs for information that 
supports planning and reasoning for goals 
achievement. Designing perception involves 
converting the understanding and inference 
processes into calculational steps (algorithms and 
inferences) and designing computation hardware 
systems to meet the requirements of information 
at rates and latencies required to deal with a 
dynamic environment. 

The proposed effort consists of partnerships 
between manufacturers, nonprofits, universities, 
and JSC to develop a set of sensors and perception 
processing appropriate to numerous task- and 
environment-driven requirements for adaptive 
robot applications, both in the U.S. economy and 
in space. Included here are vision sensing and 
visual perception, along with speech recognition 
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and task limited natural language understanding 
(speech perception). The unification of visual and 
speech perception is also included here. Proximity 
sensing, tactile/slip sensing, and force/torque 
sensing, which are critical aspects of many manipu- 
lation tasks, are addressed in the next category of 
tasks. 

Current sensing and perception efforts at JSC 
include focused developments for EVA helper/ 
retriever (laser scanner, stereo video, torque and 
proximity sensors, speech recognition and task 
limited, natural language understanding, etc.) and 
the mobile robotics testbed project (real-time 
stereo vision). 

The six tasks in this category are the 
following: 

• Vision Sensors and Sensing Software 
Development 

• Finding, Recognizing, Locating, and Tracking 
Objects and Humans 

• Visual Perception of Objects' Spatial Relations 

• Visual Perception of Objects' Condition and 
Process Participation 

• Speech Recognition and Natural Language 
Understanding 

• Unification of Visual and Speech Perception 

The benefit of sensing and perception 
capabilities is to enable the supervised intelligent 
robot both to extract needed information about 
the changing task environment, including humans, 
on a real-time basis so as to react safely and 
appropriately, and to build and continuously 
update internal representations of the changing 
environments so as to plan safe goal-achieving 
actions. People will have a natural means of 
supervision through task limited, natural language 
understanding software. The unification of visual 
and speech perception adds power to the human/ 
robot team. These benefits apply in space and in 
the U.S. terrestrial economy. 


Robotic Surrogates for Human Grasping and 
Manipulation 

Robots and humans must be capable of 
interacting with the same environment in terms of 
grasping and manipulation for certain tasks. 
Dexterous robotic grasping and manipulation 
capability must be developed to achieve this 
capability. The robot may operate in conjunction 
with a human as an apprentice or may be substi- 


tuted for a human (e.g., in hazardous operations). 
The benefits to the U .S. economy from robots with 
such capability would be very large: improved 
global competitiveness; improved productivity by 
the end users; a growing robotics industry mean- 
ing more jobs and profits; and an increased 
standard of living in the United States. In space, 
robots with these capabilities are required to 
interface with space hardware on astronaut/robot 
teams. This would reduce the cost of designing the 
robotic environment and allow more tasks to be 
done robotically. 

The proposed effort consists of partnerships 
between manufacturers and JSC to develop 
working production prototypes of human-scale 
versions of robot hands by leveraging current 
development by both sides. Integration of tactile, 
slip, force, and torque sensing; adaptive grasping; 
stable grasp recognition; and manipulation strat- 
egy approaches will be accomplished. However, it 
should be recognized that the resulting robot 
hands are not expected to be equivalent to human 
hands. Limited multitaskcapability isall thatis 
expected in the 5-year term of this effort. 

EVA helper/retriever and the dexterous, 
anthropomorphic robotic testbed (DART) are two 
of the current related efforts at JSC, as well as some 
SBIR developments. 

The nine tasks in this category are the 
following: 

• Hand designs 

• Integrated hand, wrist, and arm designs 

• Tactile/slip sensors, sensing software, and 
perception software 

• Proximity sensors, sensing software, and 
perception software 

• Force/torque sensors, sensi ng software, and 
perception software 

• Integrated sensing with hand, wrist, arm to 
provide stable grasp recognition and other 
intelligent functions 

• Grasping and manipulation strategies 

• Collision avoidance strategies 

• Compliance strategies 

Supervised intelligent robots and human 
ability to interact with the same environment in 
terms of dexterous grasping and manipulation will 
provide major benefits in U.S. industry, service 
applications, and in space. Costly special designs 
and structuring of the robot environment will be 
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minimized or eliminated, thus reducing costs. 
Robots will be able to operate in conjunction with 
humans as robot apprentices to humans on 
human/robot teams. 


An Integrated Prototyping Environment for 
Robotics 

The objective is to develop an integrated 
rapid prototyping and rapid development 
environment for building robotic systems "faster, 
better, and cheaper" based on modularity, 
reconfigurability, and extendibility, including a 
library of hardware modules {such as manipulators, 
tools, and sensors), complementary software 
modules (such as sensi ng and perception strategies 
and manipulator control), and software advisors 
designed to reduce the cost of programming 
robots. This effort would also leverage develop- 
ment of a generic intelligent robotics software 
architecture in a related subcategory. 

The proposed effort consists of partnerships 
between manufacturers and JSC to develop an 
integrated prototyping environment that will 
allow users to generate and evaluate alternate 
approaches to the design of a robotic system 
quickly. This effort would also leverage develop- 
ment of modular manipulation and mobility for 
robots and other related subcategories. 

Task-directed process design with a systems 
engineering focus and reconfigurable modular 
designs are strong points of our experience at JSC 
that are critical to success here. 

The five tasks in this category are the 
following: 

• Requirements for Prototyping Environment 

• Designof Prototyping Environment 

• Development of Prototyping Environment 

• Testingof Prototyping Environment 

• Knowledge Support System 

Automation of the process of designing and 
developing intelligent robots reduces costs and 
development time. Automation of the testing of 
intelligent robots also reduces costs and develop- 
menttime while providing the user early feedback 
thatthe robots will solve the problems. All 
markets benefit: advanced manufacturing, health 
care, service industries, construction, mining, 
space, etc. 


Robotic Applications in Advanced Manufacturing. 
Health Care. Service Industries, and Construction 

This effort's objectives are to enable the 
manufacture and marketing of supervised 
intelligent robotic systems for applications in 
advanced manufacturing, health care, service 
industries, and construction by developing 
working production prototypes. Production 
prototypes will also be developed for inspection, 
servicing, maintenance, and repair tasks for space 
exploration. Such advanced robotic systems offer 
the benefits of improved productivity by the end 
users and improved global competitiveness to the 
U.S. economy. In space, such robots provide 
improved safety, reliability, and productivity as 
Space Station evolves and enables human space 
exploration (by human/robot teams). 

The proposed effort consists of partnerships 
between manufacturers, nonprofits, doctors and 
hospitals, universities, and JSC. 

The required core skills are available at JSC in 
its civil service and contractors to form the nucleus 
of the multiple partnerships. Current technology 
integration efforts include the EVA helper/ 
retriever supervised intelligent robot, the mobile 
robotics testbed project, and the Soda-Pup entry in 
the AAAI national robotics competition. In 
addition, JSC is responsible for numerous 
applications of robotics. 

In our ongoing relationship with the Texas 
Medical Center, recent interest by Drs. Steve Kroll 
and Chuck Van Duren in robotic microsurgery and 
arterial catheterization has been shown. 

The fou r tasks i n th i s category are the 
following: 

• Robotic Applications in Advanced 
Manufacturing 

• Robotic Applications in Health Care 

• Robotic Applications in Service Industries 

• Robotic Applications in Construction 

The benefits of intelligent robots to advanced 
manufacturing are spelled out in detail in 
Erickson i . The benefit to health care is lower cost 
health care delivery with quality improvements 
due to improvements in productivity. The benefits 
to other service industry applications are improve- 
ments in productivity. The benefits of intelligent 
robots to construction include improved construc- 
tion time and productivity. 
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3 Concluding Remarks 

We have presented a "straw man" pre- 
competitive, dual use technology program in 
intelligent robotics intended to stimulate the 
creative aspects of nonaerospace and aerospace 
industry to propose their own objectives, 
approaches, and tasks for new jointly funded 
partnerships with NASA JSC. It is evidence of our 
"earnest" and that we are ready to proceed with 
our end of the partnershi ps. 
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